🤖 AI Summary
To address the limited interpretability of deep learning models in forecasting coupled solar storm events (solar flares and coronal mass ejections, CMEs), this paper proposes the first interpretable solar storm prediction framework tailored for LSTM architectures. Methodologically, we design an attention-augmented LSTM model to process multi-source time-series data from solar active regions and, for the first time in solar physics, integrate model-agnostic post-hoc explanation techniques—particularly SHAP—to enable attribution analysis and visualization of critical time steps and physically meaningful features. Our contributions are threefold: (1) systematic integration of interpretability into solar physics forecasting; (2) identification of key pre-CME magnetohydrodynamic evolution patterns and dominant predictive features—such as longitudinal magnetic field gradients and duration of shear motion—while maintaining high predictive accuracy; and (3) provision of trustworthy, traceable decision-support evidence for space weather forecasting.
📝 Abstract
A deep learning model is often considered a black-box model, as its internal workings tend to be opaque to the user. Because of the lack of transparency, it is challenging to understand the reasoning behind the model's predictions. Here, we present an approach to making a deep learning-based solar storm prediction model interpretable, where solar storms include solar flares and coronal mass ejections (CMEs). This deep learning model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun's surface that produces a flare within 24 hours will also produce a CME associated with the flare. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. To make the model's predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model's behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based solar storm prediction model.