Searth Transformer: A Transformer Architecture Incorporating Earth's Geospheric Physical Priors for Global Mid-Range Weather Forecasting

📅 2026-01-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses critical limitations in existing Transformer-based weather forecasting models, which neglect Earth’s spherical geometry, zonal periodicity, and meridional boundary conditions, while also suffering from high computational costs and severe error accumulation under conventional autoregressive training. To overcome these challenges, the authors propose the Searth Transformer architecture, which uniquely integrates geophysical priors into a windowed self-attention mechanism, alongside a low-overhead relay autoregressive (RAR) fine-tuning strategy. The resulting YanTian model, evaluated at 1° resolution, outperforms the ECMWF high-resolution system, achieving a Z500 anomaly correlation skill horizon of 10.3 days. Moreover, it reduces computational cost by approximately 200-fold compared to standard autoregressive approaches, substantially enhancing both efficiency and accuracy in medium-range global weather prediction.

Technology Category

Application Category

📝 Abstract
Accurate global medium-range weather forecasting is fundamental to Earth system science. Most existing Transformer-based forecasting models adopt vision-centric architectures that neglect the Earth's spherical geometry and zonal periodicity. In addition, conventional autoregressive training is computationally expensive and limits forecast horizons due to error accumulation. To address these challenges, we propose the Shifted Earth Transformer (Searth Transformer), a physics-informed architecture that incorporates zonal periodicity and meridional boundaries into window-based self-attention for physically consistent global information exchange. We further introduce a Relay Autoregressive (RAR) fine-tuning strategy that enables learning long-range atmospheric evolution under constrained memory and computational budgets. Based on these methods, we develop YanTian, a global medium-range weather forecasting model. YanTian achieves higher accuracy than the high-resolution forecast of the European Centre for Medium-Range Weather Forecasts and performs competitively with state-of-the-art AI models at one-degree resolution, while requiring roughly 200 times lower computational cost than standard autoregressive fine-tuning. Furthermore, YanTian attains a longer skillful forecast lead time for Z500 (10.3 days) than HRES (9 days). Beyond weather forecasting, this work establishes a robust algorithmic foundation for predictive modeling of complex global-scale geophysical circulation systems, offering new pathways for Earth system science.
Problem

Research questions and friction points this paper is trying to address.

weather forecasting
Transformer architecture
spherical geometry
zonal periodicity
autoregressive training
Innovation

Methods, ideas, or system contributions that make the work stand out.

Searth Transformer
zonal periodicity
Relay Autoregressive fine-tuning
physics-informed architecture
global weather forecasting
🔎 Similar Papers
No similar papers found.
Tianye Li
Tianye Li
NVIDIA Research
Computer GraphicsComputer VisionDigital Humans
Q
Qi Liu
State Key Laboratory of Earth System Numerical Modeling and Application, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
Hao Li
Hao Li
FUDAN UNIVERSITY,DAMO@ALIBABA
Computer VisionDeep LearningAI4S
L
Lei Chen
Shanghai Academy of Artificial Intelligence for Science, Shanghai, 200232, China
W
Wencong Cheng
Beijing Aviation Meteorological Institute, Beijing 100085, China
Fei Zheng
Fei Zheng
Institute of Atmospheric Physics, CAS
ENSOData AssimilationEnsemble Prediction
X
Xiangao Xia
Laboratory of Middle Atmosphere and Global Environment Observation (LAGEO), Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
Ya Wang
Ya Wang
Institute of psychology, Chinese Academy of Sciences
psychologypsychiatrycognitive neuroscience
G
Gang Huang
State Key Laboratory of Earth System Numerical Modeling and Application, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
Weiwei Wang
Weiwei Wang
Anhui University
micromagnetic simulation
X
Xuan Tong
Jiangsu Key Laboratory of Intelligent Weather Forecasting and Applications Based on Big Data, Nanjing University of Information Science and Technology, Nanjing 210044, China
Z
Z. Zu
State Key Laboratory of Satellite Ocean Environment Dynamics, National Marine Environmental Forecasting Center, Beijing 100081, China
Y
Yi Fang
Temperate East Asia Research Center on Global Change, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
S
Shenming Fu
State Key Laboratory of Earth System Numerical Modeling and Application, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
Jiang Jiang
Jiang Jiang
University of Nebraska Medical Center
biomaterialsdrug delivery
Haochen Li
Haochen Li
Tsinghua university
cell-cell communicationsingle-cell genomicsspatial transcriptomics
Mingxin Li
Mingxin Li
Japan Society for the Promotion of Science (JSPS) Research Fellow, The University of Tokyo
Digital twinrenewable energyoperation and maintenance
Jiangjiang Xia
Jiangjiang Xia
Institute of Atmospheric Physics, Chinese Academy of Science
machine learningweather forecast