Hierarchical Testing of a Hybrid Machine Learning-Physics Global Atmosphere Model

📅 2026-02-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study systematically evaluates the reliability and sources of bias in the hybrid machine learning–physics model NeuralGCM across multiple timescales and under out-of-distribution forcings, such as uniform warming. Through a unified experimental framework encompassing weather-scale forecasts, interannual variability driven by El Niño SST forcing, and idealized warming scenarios—augmented with cyclone tracking and teleconnection analyses—the work presents the first hierarchical, multi-scale assessment of a hybrid climate model under diverse forcing conditions. Results show that NeuralGCM closely reproduces key features of traditional physics-based models in cyclone propagation, teleconnection responses, and large-scale circulation changes under warming. However, it exhibits systematic biases, including overestimated cyclone spatial extent, phase errors in wave trains, and insufficient stratospheric response, thereby revealing both the promise and current limitations of hybrid modeling paradigms.

Technology Category

Application Category

📝 Abstract
Machine learning (ML)-based models have demonstrated high skill and computational efficiency, often outperforming conventional physics-based models in weather and subseasonal predictions. While prior studies have assessed their fidelity in capturing synoptic-scale atmospheric dynamics, their performance across timescales and under out-of-distribution forcing, such as +3K or +4K uniform-warming forcings, and the sources of biases remain elusive, to establish the model reliability for Earth science. Here, we design three sets of experiments targeting synoptic-scale phenomena, interannual variability, and out-of-distribution uniform-warming forcings. We evaluate the Neural General Circulation Model (NeuralGCM), a hybrid model integrating a dynamical core with ML-based component, against observations and physics-based Earth system models (ESMs). At the synoptic scale, NeuralGCM captures the evolution and propagation of extratropical cyclones with performance comparable to ESMs. At the interannual scale, when forced by El Ni\~no-Southern Oscillation sea surface temperature (SST) anomalies, NeuralGCM successfully reproduces associated teleconnection patterns but exhibits deficiencies in capturing nonlinear response. Under out-of-distribution uniform-warming forcings, NeuralGCM simulates similar responses in global-average temperature and precipitation and reproduces large-scale tropospheric circulation features similar to those in ESMs. Notable weaknesses include overestimating the tracks and spatial extent of extratropical cyclones, biases in the teleconnected wave train triggered by tropical SST anomalies, and differences in upper-level warming and stratospheric circulation responses to SST warming compared to physics-based ESMs. The causes of these weaknesses were explored.
Problem

Research questions and friction points this paper is trying to address.

hybrid ML-physics model
atmospheric dynamics
out-of-distribution forcing
model bias
climate reliability
Innovation

Methods, ideas, or system contributions that make the work stand out.

hybrid ML-physics modeling
hierarchical evaluation
out-of-distribution forcing
NeuralGCM
atmospheric teleconnection
🔎 Similar Papers
No similar papers found.
Z
Ziming Chen
Atmospheric, Climate, & Earth Sciences (ACES) Division, Pacific Northwest National Laboratory, Richland, Washington, USA
L. Ruby Leung
L. Ruby Leung
Pacific Northwest National Laboratory
climateprecipitationcloudextremeaerosol
W
Wenyu Zhou
Atmospheric, Climate, & Earth Sciences (ACES) Division, Pacific Northwest National Laboratory, Richland, Washington, USA
Jian Lu
Jian Lu
Shenzhen University
Signal processingImage processingMachine Learning
S
Sandro W. Lubis
Atmospheric, Climate, & Earth Sciences (ACES) Division, Pacific Northwest National Laboratory, Richland, Washington, USA
Y
Ye Liu
Atmospheric, Climate, & Earth Sciences (ACES) Division, Pacific Northwest National Laboratory, Richland, Washington, USA
C
Chuan-Chieh Chang
Atmospheric, Climate, & Earth Sciences (ACES) Division, Pacific Northwest National Laboratory, Richland, Washington, USA
B
Bryce E. Harrop
Atmospheric, Climate, & Earth Sciences (ACES) Division, Pacific Northwest National Laboratory, Richland, Washington, USA
Ya Wang
Ya Wang
Institute of psychology, Chinese Academy of Sciences
psychologypsychiatrycognitive neuroscience
M
Mingshi Yang
Department of Climate, Meteorology, and Atmospheric Sciences, University of Illinois Urbana-Champaign, 1301 W. Green Street, Urbana, IL 61801, United States of America
Gan Zhang
Gan Zhang
Guangzhou Institute of Geochemistry, CAS
Yun Qian
Yun Qian
PNNL
climateatmospheric sciencesair pollution