The critical slowing down in diffusion models

📅 2026-05-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

229K/year
🤖 AI Summary
Diffusion models suffer from critical slowing down near phase transitions, leading to inefficient sampling and training. This work establishes, for the first time, a controllable theoretical framework within the Gaussian limit of statistical field theory to elucidate the mechanism of critical slowing down in parameter learning and proposes a solution that integrates deep architectures with physical locality. By combining single- and two-layer neural networks with local score approximations, the approach is theoretically analyzed and experimentally validated in the O(n) model. Results demonstrate that the two-layer architecture reduces training complexity from quadratic to logarithmic in system size—without increasing the number of parameters—thereby substantially enhancing sampling efficiency in critical regions.
📝 Abstract
Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theoretical control over when and why they succeed. Here we provide such insight for diffusion models-a class of generative schemes highly effective in practice-by analyzing their application to the $O(n)$ model of statistical field theory in the Gaussian limit $n \to \infty$. In this analytically tractable setting, we show that training a score model with a one-layer network architecture matching the exact solution exhibits a form of critical slowing down in parameter learning. This slowing down also impacts the generation process, indicating that the well-known difficulties of sampling near criticality persist even for learned generative models. To overcome this bottleneck, we demonstrate the power of combining architectural depth with physical locality. We find that using a two-layer architecture drastically reduces the critical slowing down, with the training time scaling logarithmically rather than quadratically with system size. By introducing a local score approximation we show that this acceleration in training time can be achieved without increasing the number of neural network parameters. Taken together, these results demonstrate that diffusion models can overcome the critical slowing down through appropriate architectural design, and establish a controlled framework for understanding and improving learned sampling methods in statistical physics and beyond.
Problem

Research questions and friction points this paper is trying to address.

critical slowing down
diffusion models
sampling
statistical physics
generative models
Innovation

Methods, ideas, or system contributions that make the work stand out.

critical slowing down
diffusion models
score-based generative modeling
neural architecture design
local score approximation