🤖 AI Summary
This work investigates the high-accuracy approximation of target conditional distributions using conditional diffusion models. By decomposing the output error over path space into terminal mismatch and the sum of per-step reverse kernel errors, each step is recast as a static conditional density approximation problem. Employing finite Gaussian mixture reverse kernels with ReLU network logits and assuming exact terminal matching, the study establishes, for the first time, the denseness of such neural reverse kernels in the sense of conditional KL divergence, thereby providing a universal approximation theory for conditional diffusion models. Leveraging Norets’ Gaussian mixture approximation, quantitative bounds for ReLU networks, and the error decomposition framework, the approach can approximate regular target distributions arbitrarily well under the context-averaged conditional KL divergence, with terminal mismatch vanishing as the number of diffusion steps increases.
📝 Abstract
We prove that conditional diffusion models whose reverse kernels are finite Gaussian mixtures with ReLU-network logits can approximate suitably regular target distributions arbitrarily well in context-averaged conditional KL divergence, up to an irreducible terminal mismatch that typically vanishes with increasing diffusion horizon. A path-space decomposition reduces the output error to this mismatch plus per-step reverse-kernel errors; assuming each reverse kernel factors through a finite-dimensional feature map, each step becomes a static conditional density approximation problem, solved by composing Norets' Gaussian-mixture theory with quantitative ReLU bounds. Under exact terminal matching the resulting neural reverse-kernel class is dense in conditional KL.