Published November 15, 2024 | Version v1
Journal article Open

Using Explainable AI and Transfer Learning to Understand and Predict the Maintenance of Atlantic Blocking With Limited Observational Data

  • 1. New York University
  • 2. Massachusetts Institute of Technology
  • 3. University of Chicago

Description

Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place. The duration of blocking events is, however, underestimated in climate models. Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events and diagnose model deficiencies. We demonstrate this approach on an idealized quasigeostrophic (QG) model developed by Marshall and Molteni (1993), https://doi.org/10.1175/1520-0469(1993)050<1792:taduop>2.0.co;2. We train a convolutional neural network (CNN), and subsequently, build a sparse predictive model for the persistence of Atlantic blocking, conditioned on an initial high-pressure anomaly. Shapley Additive ExPlanation (SHAP) analysis reveals that high-pressure anomalies in the American Southeast and North Atlantic, separated by a trough over Atlantic Canada, contribute significantly to prediction of sustained blocking events in the Atlantic region. This agrees with previous work that identified precursors in the same regions via wave train analysis. When we apply the same CNN to blockings in the ERA5 atmospheric reanalysis, there is insufficient data to accurately predict persistent blocks. We partially overcome this limitation by pre-training the CNN on the plentiful data of the Marshall-Molteni model, and then using Transfer learning (TL) to achieve better predictions than direct training. SHAP analysis before and after TL allows a comparison between the predictive features in the reanalysis and the QG model, quantifying dynamical biases in the idealized model. This work demonstrates the potential for machine learning methods to extract meaningful precursors of extreme weather events and achieve better prediction using limited observational data.

Data availability

The data from the Marshall-Molteni model were generated using a Fortran code provided by Valerio Lucarini and Andrey Gritsun (Lucarini & Gritsun, 2020). The Fortran code, along with the Python code for computing SHAP values, TL and producing plots is publicly available in the open repository (Zhang, 2024). SHAP values were computed using the Python package DeepSHAP (Chen, 2022). The ERA5 reanalysis data sets from ECWMF were used for data preprocessing and ML model training and testing (Hersbach et al., 2020).

Files

Using-Explainable-AI-and-Transfer-Learning-to-Understand-and-Predict-the-Maintenance.pdf

Additional details

Identifiers

DOI
10.1029/2024JH000243
Other
oai:uchicago.tind.io:14026

Funding

Army Research Office
W911NF-22-2-0124
National Science Foundation
OAC-2004572
MIT
Climate Grand Challenge on Weather and Climate Extremes
Virtual Earth Systems Research Institute, Schmidt Sciences

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Geophysical Sciences