Surya: Foundation Model for Heliophysics

📅 2025-08-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Solar physics confronts challenges including task-specific modeling, scarcity of labeled data, and poor generalization. To address these, we introduce the first foundational model for solar physics: a spatiotemporal Transformer architecture (366M parameters) that jointly models full-resolution SDO/AIA and HMI multi-channel remote-sensing observations. Our method innovates with time-advancing prediction as the primary self-supervised pretraining objective and integrates spectral gating with hybrid long- and short-range attention mechanisms. We further employ autoregressive rollout pretraining and LoRA-based fine-tuning. Evaluated in zero-shot settings, the model successfully forecasts solar dynamical evolution and solar flare onset. It also achieves state-of-the-art performance on downstream tasks—including solar wind forecasting and active region segmentation—demonstrating its capacity to learn universal, physically grounded representations of solar evolutionary dynamics.

Technology Category

Application Category

📝 Abstract
Heliophysics is central to understanding and forecasting space weather events and solar activity. Despite decades of high-resolution observations from the Solar Dynamics Observatory (SDO), most models remain task-specific and constrained by scarce labeled data, limiting their capacity to generalize across solar phenomena. We introduce Surya, a 366M parameter foundation model for heliophysics designed to learn general-purpose solar representations from multi-instrument SDO observations, including eight Atmospheric Imaging Assembly (AIA) channels and five Helioseismic and Magnetic Imager (HMI) products. Surya employs a spatiotemporal transformer architecture with spectral gating and long--short range attention, pretrained on high-resolution solar image forecasting tasks and further optimized through autoregressive rollout tuning. Zero-shot evaluations demonstrate its ability to forecast solar dynamics and flare events, while downstream fine-tuning with parameter-efficient Low-Rank Adaptation (LoRA) shows strong performance on solar wind forecasting, active region segmentation, solar flare forecasting, and EUV spectra. Surya is the first foundation model in heliophysics that uses time advancement as a pretext task on full-resolution SDO data. Its novel architecture and performance suggest that the model is able to learn the underlying physics behind solar evolution.
Problem

Research questions and friction points this paper is trying to address.

Develops foundation model for solar activity forecasting
Addresses limited generalization in task-specific heliophysics models
Learns solar physics from multi-instrument SDO observations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Surya foundation model with spatiotemporal transformer architecture
Pretrained on solar image forecasting tasks
Uses Low-Rank Adaptation for efficient fine-tuning
🔎 Similar Papers
No similar papers found.
S
Sujit Roy
Earth System Science Center, University of Alabama in Huntsville, AL, USA; NASA Marshall Space Flight Center, Huntsville, AL, USA
Johannes Schmude
Johannes Schmude
IBM Research
Rohit Lal
Rohit Lal
Earth System Science Center, University of Alabama in Huntsville, AL, USA
Vishal Gaur
Vishal Gaur
Earth System Science Center, University of Alabama in Huntsville, AL, USA
M
Marcus Freitag
IBM Research
Julian Kuehnert
Julian Kuehnert
IBM Research
Geospatial AIFoundation ModelsGeophysics
T
Theodore van Kessel
IBM Research
D
Dinesha V. Hegde
Department of Space Science, University of Alabama in Huntsville, AL, USA; Center for Space Plasma and Aeronomic Research (CSPAR), University of Alabama in Huntsville, AL, USA
A
Andrés Muñoz-Jaramillo
Southwest Research Institute
Johannes Jakubik
Johannes Jakubik
Research Scientist @ IBM Research Europe
AI for Climate ImpactDeep learningMulti-modality
E
Etienne Vos
IBM Research
K
Kshitiz Mandal
Earth System Science Center, University of Alabama in Huntsville, AL, USA
A
Ata Akbari Asanjan
Research Institute for Advanced Computer Science, Universities Space Research Association, USA
J
Joao Lucas de Sousa Almeida
IBM Research
A
Amy Lin
Earth System Science Center, University of Alabama in Huntsville, AL, USA
T
Talwinder Singh
Georgia State University
K
Kang Yang
Georgia State University
Chetraj Pandey
Chetraj Pandey
Assistant Professor, Department of Computer Science, Texas Christian University
Explainable Deep LearningContinual LearningSpace Weather
J
Jinsu Hong
Georgia State University
B
Berkay Aydin
Georgia State University
T
Thorsten Kurth
NVIDIA Corp., Santa Clara, USA; Caltech, Pasadena, USA
R
Ryan McGranaghan
NASA Jet Propulsion Laboratory
S
Spiridon Kasapis
Princeton University
Vishal Upendran
Vishal Upendran
Research Scientist, SETI Institute / LMSAL
Astronomy and AstrophysicsDeep learning.
S
Shah Bahauddin
Laboratory for Atmospheric and Space Physics, University of Colorado Boulder