Sample Complexity of Diffusion Model Training Without Empirical Risk Minimizer Access

📅 2025-05-23

📈 Citations: 0

✨ Influential: 0

career value

210K/year

🤖 AI Summary

Prior theoretical analyses of diffusion model training suffer from either strong assumptions—such as exact access to the empirical risk minimizer (ERM)—or exponential dependence on dimensionality, rendering them inapplicable to high-dimensional settings. Method: We develop the first rigorous sample complexity framework for score estimation *without* requiring ERM access. Our approach leverages a structured error decomposition (statistical, approximation, and optimization components), novel non-convex score matching theory, and tight generalization error bounds. Crucially, we decouple neural network parameterization from dimensional scaling. Contribution/Results: We establish a tight $ ilde{O}(varepsilon^{-6})$ sample complexity upper bound for score estimation, eliminating the exponential dependence on dimension that plagues prior work. This yields the first verifiable, assumption-light, and dimensionally favorable theoretical guarantee for high-dimensional diffusion model training—significantly advancing the theoretical foundations of generative modeling.

Technology Category

Application Category

📝 Abstract

Diffusion models have demonstrated state-of-the-art performance across vision, language, and scientific domains. Despite their empirical success, prior theoretical analyses of the sample complexity suffer from poor scaling with input data dimension or rely on unrealistic assumptions such as access to exact empirical risk minimizers. In this work, we provide a principled analysis of score estimation, establishing a sample complexity bound of $widetilde{mathcal{O}}(epsilon^{-6})$. Our approach leverages a structured decomposition of the score estimation error into statistical, approximation, and optimization errors, enabling us to eliminate the exponential dependence on neural network parameters that arises in prior analyses. It is the first such result which achieves sample complexity bounds without assuming access to the empirical risk minimizer of score function estimation loss.

Problem

Research questions and friction points this paper is trying to address.

Analyzing sample complexity of diffusion model training

Eliminating exponential dependence on neural network parameters

Achieving bounds without empirical risk minimizer access

Innovation

Methods, ideas, or system contributions that make the work stand out.

Structured decomposition of score estimation error

Eliminates exponential dependence on network parameters

Achieves sample complexity without empirical risk minimizer

🔎 Similar Papers

Learning Diffusion Priors from Observations by Expectation Maximization