Deep classifier kriging for probabilistic spatial prediction of air quality index

πŸ“… 2025-12-29
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Traditional kriging methods suffer from modeling inaccuracies when applied to non-Gaussian and highly nonlinear air quality index (AQI) spatial fields. To address this, we propose the Deep Classifier Kriging (DCK) frameworkβ€”a novel probabilistic interpolation paradigm that unifies classifier-driven distribution estimation, spatial covariance learning, and multi-source data fusion (ground station observations and numerical model outputs). DCK explicitly relaxes the linearity and Gaussianity assumptions inherent in classical kriging, enabling non-collocated bivariate modeling and yielding high-accuracy, well-calibrated full predictive distributions. Evaluated on both synthetic and real-world AQI datasets, DCK significantly improves point prediction accuracy and uncertainty quantification quality compared to conventional approaches. It produces spatially resolved predictive distributions, facilitating practical applications such as exceedance probability mapping and risk assessment of extreme pollution events.

Technology Category

Application Category

πŸ“ Abstract
Accurate spatial interpolation of the air quality index (AQI), computed from concentrations of multiple air pollutants, is essential for regulatory decision-making, yet AQI fields are inherently non-Gaussian and often exhibit complex nonlinear spatial structure. Classical spatial prediction methods such as kriging are linear and rely on Gaussian assumptions, which limits their ability to capture these features and to provide reliable predictive distributions. In this study, we propose extit{deep classifier kriging} (DCK), a flexible, distribution-free deep learning framework for estimating full predictive distribution functions for univariate and bivariate spatial processes, together with a extit{data fusion} mechanism that enables modeling of non-collocated bivariate processes and integration of heterogeneous air pollution data sources. Through extensive simulation experiments, we show that DCK consistently outperforms conventional approaches in predictive accuracy and uncertainty quantification. We further apply DCK to probabilistic spatial prediction of AQI by fusing sparse but high-quality station observations with spatially continuous yet biased auxiliary model outputs, yielding spatially resolved predictive distributions that support downstream tasks such as exceedance and extreme-event probability estimation for regulatory risk assessment and policy formulation.
Problem

Research questions and friction points this paper is trying to address.

Develops deep classifier kriging for non-Gaussian spatial prediction
Enables data fusion of heterogeneous air pollution sources
Provides probabilistic AQI distributions for regulatory risk assessment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep classifier kriging for non-Gaussian spatial prediction
Data fusion integrates heterogeneous air pollution sources
Distribution-free deep learning framework for predictive distributions
πŸ”Ž Similar Papers
No similar papers found.
J
Junyu Chen
Department of Statistics, The George Washington University, Washington DC, USA
Pratik Nag
Pratik Nag
Research Fellow, NIASRA
Spatio-temporal statisticsArtificial Intelligence
H
Huixia Judy-Wang
Department of Statistics, Rice University, Houston, USA
Y
Ying Sun
CEMSE Division, Statistics Program, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia