AGCD: Agent-Guided Cross-Modal Decoding for Weather Forecasting

📅 2026-03-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Autoregressive weather forecasting often suffers from large-scale structural distortions and physical inconsistencies due to error accumulation across single-step predictions. To address this, this work proposes a plug-and-play prior injection paradigm during decoding, introducing for the first time a state-adaptive physical prior generation mechanism grounded in multi-agent and multimodal large language models (MLLMs). The approach further incorporates region-aware multiscale tokenization and cross-modal regional interaction strategies within the decoder, enabling controllable and reusable prior guidance without modifying the backbone architecture. Experiments on WeatherBench demonstrate consistent improvements in 6-hour forecast accuracy across multiple resolutions and backbone models. Moreover, under strict causal autoregressive inference over 48 hours, the method significantly mitigates early error propagation, thereby enhancing long-term structural consistency and stability.

Technology Category

Application Category

📝 Abstract
Accurate weather forecasting is more than grid-wise regression: it must preserve coherent synoptic structures and physical consistency of meteorological fields, especially under autoregressive rollouts where small one-step errors can amplify into structural bias. Existing physics-priors approaches typically impose global, once-for-all constraints via architectures, regularization, or NWP coupling, offering limited state-adaptive and sample-specific controllability at deployment. To bridge this gap, we propose Agent-Guided Cross-modal Decoding (AGCD), a plug-and-play decoding-time prior-injection paradigm that derives state-conditioned physics-priors from the current multivariate atmosphere and injects them into forecasters in a controllable and reusable way. Specifically, We design a multi-agent meteorological narration pipeline to generate state-conditioned physics-priors, utilizing MLLMs to extract various meteorological elements effectively. To effectively apply the priors, AGCD further introduce cross-modal region interaction decoding that performs region-aware multi-scale tokenization and efficient physics-priors injection to refine visual features without changing the backbone interface. Experiments on WeatherBench demonstrate consistent gains for 6-hour forecasting across two resolutions (5.625 degree and 1.40625 degree) and diverse backbones (generic and weather-specialized), including strictly causal 48-hour autoregressive rollouts that reduce early-stage error accumulation and improve long-horizon stability.
Problem

Research questions and friction points this paper is trying to address.

weather forecasting
physical consistency
autoregressive rollout
synoptic structures
state-adaptive control
Innovation

Methods, ideas, or system contributions that make the work stand out.

Agent-Guided Decoding
Physics-Informed Forecasting
Cross-Modal Interaction
Multimodal LLMs
Autoregressive Weather Prediction
J
Jing Wu
Sun Yat-sen University, Zhuhai, China
Yang Liu
Yang Liu
University of Hong Kong
Macro-FinanceInternational Finance
L
Lin Zhang
Jiangxi Science and Technology Normal University, Nanchang, China
J
Junbo Zeng
Sun Yat-sen University, Zhuhai, China
J
Jiabin Wang
Sun Yat-sen University, Zhuhai, China
Z
Zi Ye
Sun Yat-sen University, Zhuhai, China
G
Guowen Li
Sun Yat-sen University, Zhuhai, China
Shilei Cao
Shilei Cao
Sun Yat-sen University
machine learningartificial intelligencecomputer visiontransfer learningSpatio-Temporal
Jiashun Cheng
Jiashun Cheng
Hong Kong University of Science and Technology
machine learning
Fang Wang
Fang Wang
Postdoc, Stanford University
Reading acquisitiondyslexiacross-linguistic researchbilingualismcognitive neuroscience
M
Meng Jin
Huawei Technologies Co., Ltd, China
Y
Yerong Feng
Guangdong-Hong Kong-Macao Greater Bay Area Weather Research Center for Monitoring Warning and Forecasting, China
Hong Cheng
Hong Cheng
Professor, The Chinese University of Hong Kong
Data MiningDatabaseMachine Learning
Y
Yutong Lu
National Supercomputing Center in Shenzhen, Shenzhen, China
Haohuan Fu
Haohuan Fu
Tsinghua University
J
Juepeng Zheng
Sun Yat-sen University, Zhuhai, China