Crowdsourcing the Frontier: Advancing Hybrid Physics-ML Climate Simulation via $50,000 Kaggle Competition

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Machine learning (ML) parameterizations for climate modeling often suffer from online instability and inconsistent performance when coupled to full-physics climate models. Method: This study leverages a $50,000 Kaggle competition to crowdsource surrogate models for subgrid-scale physical processes, using the ClimSim dataset. Multiple deep learning architectures are designed and rigorously evaluated via online coupling to an interactive climate model featuring comprehensive cloud microphysics. Contribution/Results: All top-performing competition models achieve long-term online stability. Expanding input variables significantly improves predictive accuracy. Several architectures attain state-of-the-art (SOTA) performance on key metrics—including zonal-mean bias and global root-mean-square error—while exhibiting strong offline–online consistency. This work constitutes the first systematic demonstration of the feasibility and reproducibility of the crowdsourcing paradigm for climate ML parameterization. It establishes a viable pathway toward high-resolution, computationally efficient, and reliable long-term climate prediction.

Technology Category

Application Category

📝 Abstract
Subgrid machine-learning (ML) parameterizations have the potential to introduce a new generation of climate models that incorporate the effects of higher-resolution physics without incurring the prohibitive computational cost associated with more explicit physics-based simulations. However, important issues, ranging from online instability to inconsistent online performance, have limited their operational use for long-term climate projections. To more rapidly drive progress in solving these issues, domain scientists and machine learning researchers opened up the offline aspect of this problem to the broader machine learning and data science community with the release of ClimSim, a NeurIPS Datasets and Benchmarks publication, and an associated Kaggle competition. This paper reports on the downstream results of the Kaggle competition by coupling emulators inspired by the winning teams' architectures to an interactive climate model (including full cloud microphysics, a regime historically prone to online instability) and systematically evaluating their online performance. Our results demonstrate that online stability in the low-resolution, real-geography setting is reproducible across multiple diverse architectures, which we consider a key milestone. All tested architectures exhibit strikingly similar offline and online biases, though their responses to architecture-agnostic design choices (e.g., expanding the list of input variables) can differ significantly. Multiple Kaggle-inspired architectures achieve state-of-the-art (SOTA) results on certain metrics such as zonal mean bias patterns and global RMSE, indicating that crowdsourcing the essence of the offline problem is one path to improving online performance in hybrid physics-AI climate simulation.
Problem

Research questions and friction points this paper is trying to address.

Develop stable machine-learning subgrid parameterizations for climate models
Address online instability in hybrid physics-ML climate simulations
Improve long-term climate projection accuracy via crowdsourced solutions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid physics-ML parameterizations for climate models
Crowdsourced Kaggle competition to solve offline issues
Coupling winning architectures to interactive climate simulation
🔎 Similar Papers
No similar papers found.
J
Jerry Lin
Department of Earth System Sciences, University of California at Irvine, Irvine, CA, USA
Zeyuan Hu
Zeyuan Hu
UCLA
Tom Beucler
Tom Beucler
Assistant Professor, University of Lausanne
Atmospheric PhysicsClimate InformaticsScientific Machine LearningTropical Meteorology
K
Katherine Frields
Department of Earth System Sciences, University of California at Irvine, Irvine, CA, USA
Hannah Christensen
Hannah Christensen
University of Oxford
Weather & Climate PredictionUncertainty quantificationMachine LearningModel development
W
Walter Hannah
Lawrence Livermore National Laboratory
H
Helge Heuer
Deutsches Zentrum für Luft- und Raumfahrt, Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany
Peter Ukkonen
Peter Ukkonen
Department of Physics, University of Oxford, Oxford, United Kingdom
L
Laura A. Mansfield
Department of Physics, University of Oxford, Oxford, United Kingdom
T
Tian Zheng
Department of Statistics, Columbia University, New York, NY, USA
L
Liran Peng
Department of Earth System Sciences, University of California at Irvine, Irvine, CA, USA
Ritwik Gupta
Ritwik Gupta
Postdoctoral Researcher, University of California, Berkeley
Computer VisionHumanitarian AssistanceDisaster ResponsePublic Policy
Pierre Gentine
Pierre Gentine
Professor @ Columbia University - Director NSF LEAP STC
climate changeclimate modelingecohydrologymachine learning
Y
Yusef Al-Naher
Z Lab, China
M
Mingjiang Duan
Z Lab, China
K
Kyo Hattori
ABEJA Inc., Japan
W
Weiliang Ji
Z Lab, China
C
Chunhan Li
Z Lab, China
K
Kippei Matsuda
Kawasaki Heavy Industries, Ltd., Japan
N
Naoki Murakami
DeNA Co., Ltd
S
Shlomo Ron
Z Lab, China
M
Marec Serlin
Uber Technologies, Inc.
H
Hongjian Song
Z Lab, China
Y
Yuma Tanabe
Z Lab, China
Daisuke Yamamoto
Daisuke Yamamoto
Z Lab, China