🤖 AI Summary
This work addresses the fundamental trade-off between discrete-time modeling and continuous-time stochastic differential equation (SDE) modeling in denoising diffusion probabilistic models (DDPMs) and score-based generative models (SGMs). Specifically, it tackles the challenge that discretization-induced errors propagate through reverse sampling, degrading sample quality. To this end, we first unify discrete and continuous modeling paradigms by deriving a total variation (TV) distance bound—integrating discrete Girsanov transformation, Pinsker’s inequality, and the data processing inequality. This bound rigorously characterizes the performance limits of both frameworks. We further obtain an analytically tractable TV upper bound that quantitatively exposes the coupled influence of step size, noise schedule, and score estimation error. Our theoretical results provide principled, information-theoretic guidance for designing efficient and robust discrete-time sampling algorithms in diffusion models.
📝 Abstract
This work explores the theoretical and practical foundations of denoising diffusion probabilistic models (DDPMs) and score-based generative models, which leverage stochastic processes and Brownian motion to model complex data distributions. These models employ forward and reverse diffusion processes defined through stochastic differential equations (SDEs) to iteratively add and remove noise, enabling high-quality data generation. By analyzing the performance bounds of these models, we demonstrate how score estimation errors propagate through the reverse process and bound the total variation distance using discrete Girsanov transformations, Pinsker's inequality, and the data processing inequality (DPI) for an information theoretic lens.