An Interpretable Machine Learning Approach to Understanding the Relationships between Solar Flares and Source Active Regions

📅 2025-02-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates the decisive role of active region (AR) physical properties in triggering C-class or stronger solar flares to support early, precise space weather forecasting. Leveraging multi-source solar observational data from 2011–2021, we develop an interpretable random forest model for high-accuracy binary classification of flare occurrence probability. Feature importance analysis identifies AR_Type_Today as the strongest predictor and Hale_Class_Yesterday as the weakest; NoS_Difference emerges as a consistently influential driver in both global and local explanations. We further employ SHAP and LIME to elucidate the mechanistic roles of key features. Our work establishes the first machine learning paradigm that jointly achieves high predictive performance and model interpretability for solar flare forecasting—bridging empirical data-driven methods with physical understanding. This framework advances both flare physics modeling and operational space weather prediction.

Technology Category

Application Category

📝 Abstract
Solar flares are defined as outbursts on the surface of the Sun. They occur when energy accumulated in magnetic fields enclosing solar active regions (ARs) is abruptly expelled. Solar flares and associated coronal mass ejections are sources of space weather that adversely impact devices at or near Earth, including the obstruction of high-frequency radio waves utilized for communication and the deterioration of power grid operations. Tracking and delivering early and precise predictions of solar flares is essential for readiness and catastrophe risk mitigation. This paper employs the random forest (RF) model to address the binary classification task, analyzing the links between solar flares and their originating ARs with observational data gathered from 2011 to 2021 by SolarMonitor.org and the XRT flare database. We seek to identify the physical features of a source AR that significantly influence its potential to trigger>=C-class flares. We found that the features of AR_Type_Today, Hale_Class_Yesterday are the most and the least prepotent features, respectively. NoS_Difference has a remarkable effect in decision-making in both global and local interpretations.
Problem

Research questions and friction points this paper is trying to address.

Understanding solar flare origins
Predicting solar flare impacts
Analyzing active region features
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses random forest model
Analyzes solar flare data
Identifies key AR features
🔎 Similar Papers
No similar papers found.
H
Huseyin Cavus
Department of Physics, Canakkale Onsekiz Mart University, 17110 Canakkale, Turkey
Jason T. L. Wang
Jason T. L. Wang
College of Computing, New Jersey Institute of Technology, Newark, NJ 07102, USA
T
Teja P. S. Singampalli
College of Computing, New Jersey Institute of Technology, Newark, NJ 07102, USA
G
G. Ç. Coban
Department of Physics, Canakkale Onsekiz Mart University, 17110 Canakkale, Turkey
Hongyang Zhang
Hongyang Zhang
Assistant Professor of Computer Science, University of Waterloo
Machine LearningInference AccelerationAI Security
A
A. Raheem
Department of Physics, Canakkale Onsekiz Mart University, 17110 Canakkale, Turkey
Haimin Wang
Haimin Wang
New Jersey Institute of Technology
PhysicsSpace Science