🤖 AI Summary
Global air quality forecasting is hindered by strong spatial heterogeneity and poor generalization of models to unseen regions. To address this, this work proposes OmniAir, a novel framework that introduces an inductive semantic topology learning mechanism tailored for global station-level prediction. OmniAir generates generalizable station identities by encoding invariant physical-environmental attributes and dynamically constructs an adaptive sparse graph to capture non-Euclidean long-range dependencies and physical dispersion patterns. Evaluated on the WorldAir dataset encompassing over 7,800 monitoring stations, OmniAir significantly outperforms 18 baseline models while achieving nearly a tenfold speedup in inference, effectively mitigating monitoring blind spots in data-sparse regions.
📝 Abstract
Global air quality forecasting grapples with extreme spatial heterogeneity and the poor generalization of existing transductive models to unseen regions. To tackle this, we propose OmniAir, a semantic topology learning framework tailored for global station-level prediction. By encoding invariant physical environmental attributes into generalizable station identities and dynamically constructing adaptive sparse topologies, our approach effectively captures long-range non-Euclidean correlations and physical diffusion patterns across unevenly distributed global networks. We further curate WorldAir, a massive dataset covering over 7,800 stations worldwide. Extensive experiments show that OmniAir achieves state-of-the-art performance against 18 baselines, maintaining high efficiency and scalability with speeds nearly 10 times faster than existing models, while effectively bridging the monitoring gap in data-sparse regions.