🤖 AI Summary
Classical data assimilation methods struggle to accurately approximate the true posterior in highly nonlinear systems. To address this, we propose the State-Observation Augmented Diffusion (SOAD) model, which jointly models the generative process of states and observations. Crucially, we provide the first rigorous proof that SOAD’s marginal posterior exactly equals the true posterior—overcoming a fundamental theoretical limitation of conventional score-matching approaches in nonlinear settings. SOAD unifies physical state-space modeling and observation operator embedding within a diffusion probabilistic framework, enabling deep integration of physics-based constraints and data-driven learning via joint training of denoising score estimation and variational inference. Evaluated on multiple strongly nonlinear assimilation benchmarks, SOAD consistently outperforms existing data-driven methods, achieving superior posterior estimation accuracy and robustness.
📝 Abstract
Data assimilation has become a key technique for combining physical models with observational data to estimate state variables. However, classical assimilation algorithms often struggle with the high nonlinearity present in both physical and observational models. To address this challenge, a novel generative model, termed the State-Observation Augmented Diffusion (SOAD) model is proposed for data-driven assimilation. The marginal posterior associated with SOAD has been derived and then proved to match the true posterior distribution under mild assumptions, suggesting its theoretical advantages over previous score-based approaches. Experimental results also indicate that SOAD may offer improved performance compared to existing data-driven methods.