Published November 15, 2024
| Version v1
Journal article
Open
Using Explainable AI and Transfer Learning to Understand and Predict the Maintenance of Atlantic Blocking With Limited Observational Data
- 1. New York University
- 2. Massachusetts Institute of Technology
- 3. University of Chicago
Description
Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place. The duration of blocking events is, however, underestimated in climate models. Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events and diagnose model deficiencies. We demonstrate this approach on an idealized quasigeostrophic (QG) model developed by Marshall and Molteni (1993), https://doi.org/10.1175/1520-0469(1993)050<1792:taduop>2.0.co;2. We train a convolutional neural network (CNN), and subsequently, build a sparse predictive model for the persistence of Atlantic blocking, conditioned on an initial high-pressure anomaly. Shapley Additive ExPlanation (SHAP) analysis reveals that high-pressure anomalies in the American Southeast and North Atlantic, separated by a trough over Atlantic Canada, contribute significantly to prediction of sustained blocking events in the Atlantic region. This agrees with previous work that identified precursors in the same regions via wave train analysis. When we apply the same CNN to blockings in the ERA5 atmospheric reanalysis, there is insufficient data to accurately predict persistent blocks. We partially overcome this limitation by pre-training the CNN on the plentiful data of the Marshall-Molteni model, and then using Transfer learning (TL) to achieve better predictions than direct training. SHAP analysis before and after TL allows a comparison between the predictive features in the reanalysis and the QG model, quantifying dynamical biases in the idealized model. This work demonstrates the potential for machine learning methods to extract meaningful precursors of extreme weather events and achieve better prediction using limited observational data.
Data availability
The data from the Marshall-Molteni model were generated using a Fortran code provided by Valerio Lucarini and Andrey Gritsun (Lucarini & Gritsun, 2020). The Fortran code, along with the Python code for computing SHAP values, TL and producing plots is publicly available in the open repository (Zhang, 2024). SHAP values were computed using the Python package DeepSHAP (Chen, 2022). The ERA5 reanalysis data sets from ECWMF were used for data preprocessing and ML model training and testing (Hersbach et al., 2020).Files
Using-Explainable-AI-and-Transfer-Learning-to-Understand-and-Predict-the-Maintenance.pdf
Files
(4.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:356953c8d0626d886d56e2d404e616a9
|
4.0 MB | Preview Download |
Additional details
Identifiers
- DOI
- 10.1029/2024JH000243
- Other
- oai:uchicago.tind.io:14026
Funding
- Army Research Office
- W911NF-22-2-0124
- National Science Foundation
- OAC-2004572
- MIT
- Climate Grand Challenge on Weather and Climate Extremes
- Virtual Earth Systems Research Institute, Schmidt Sciences