Local Off-Grid Weather Forecasting with Multi-Modal Earth Observation Data

📅 2024-10-16
📈 Citations: 5
Influential: 0
📄 PDF
🤖 AI Summary
Large-scale gridded weather forecasts often fail to resolve near-surface local meteorological conditions, limiting their utility in high-temporal-demand applications such as wildfire management and renewable energy dispatch. To address this, we propose an end-to-end, site-specific fine-grained forecasting framework. Our approach introduces a novel station-level multimodal tokenization scheme that jointly encodes gridded forecasts and localized in-situ observations. We further design a spatiotemporal self-attention mechanism enabling dynamic cross-station information aggregation and—critically—demonstrate for the first time that historical observations can induce phase transitions in forecast evolution, substantially improving nowcasting through medium-range prediction accuracy. Evaluated across multiple stations in the U.S. Northeast, our method consistently outperforms leading data-driven and physics-based models, reducing key error metrics by up to 80%. This advances reliable decision support for high-risk operational scenarios.

Technology Category

Application Category

📝 Abstract
Urgent applications like wildfire management and renewable energy generation require precise, localized weather forecasts near the Earth's surface. However, forecasts produced by machine learning models or numerical weather prediction systems are typically generated on large-scale regular grids, where direct downscaling fails to capture fine-grained, near-surface weather patterns. In this work, we propose a multi-modal transformer model trained end-to-end to downscale gridded forecasts to off-grid locations of interest. Our model directly combines local historical weather observations (e.g., wind, temperature, dewpoint) with gridded forecasts to produce locally accurate predictions at various lead times. Multiple data modalities are collected and concatenated at station-level locations, treated as a token at each station. Using self-attention, the token corresponding to the target location aggregates information from its neighboring tokens. Experiments using weather stations across the Northeastern United States show that our model outperforms a range of data-driven and non-data-driven off-grid forecasting methods. They also reveal that direct input of station data provides a phase shift in local weather forecasting accuracy, reducing the prediction error by up to 80% compared to pure gridded data based models. This approach demonstrates how to bridge the gap between large-scale weather models and locally accurate forecasts to support high-stakes, location-sensitive decision-making.
Problem

Research questions and friction points this paper is trying to address.

Downscaling gridded forecasts to off-grid locations
Combining local observations with gridded forecasts
Improving near-surface weather prediction accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-modal transformer model for local forecasting
Combines local observations with gridded forecasts
Self-attention aggregates neighboring station data
🔎 Similar Papers
No similar papers found.
Q
Qidong Yang
Massachusetts Institute of Technology
J
Jonathan Giezendanner
Massachusetts Institute of Technology
D
D. S. Civitarese
IBM Research
Johannes Jakubik
Johannes Jakubik
Research Scientist @ IBM Research Europe
AI for Climate ImpactDeep learningMulti-modality
E
Eric Schmitt
Shell Information Technology International Inc.
A
Anirban Chandra
Shell Information Technology International Inc.
J
Jeremy Vila
Shell Information Technology International Inc.
D
Detlef Hohl
Shell Information Technology International Inc.
Chris Hill
Chris Hill
Massachusetts Institute of Technology
C
Campbell Watson
IBM Research
Sherrie Wang
Sherrie Wang
Assistant Professor, Earth Intelligence Lab, MIT
machine learningremote sensingagriculturesustainability