Differentiable Solver Search for Fast Diffusion Sampling

📅 2025-05-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Diffusion model sampling requires numerous function evaluations, incurring substantial computational overhead; existing ODE-based solvers predominantly rely on time-$t$-dependent Lagrange interpolation, which has been shown to be suboptimal. This paper proposes a differentiable solver search framework that jointly optimizes ODE solver architecture end-to-end within a compact spatio-temporal differentiable space, thereby uncovering and circumventing the inherent limitations of $t$-dependent interpolation for the first time. Our method integrates differentiable architecture search, ODE parameterization modeling, and gradient-driven coefficient optimization, enabling plug-and-play deployment across diverse diffusion models—including Rectified Flow and DDPM. On ImageNet256, our solvers achieve FID scores of 2.40, 2.35, and 2.33 using only 10 sampling steps for SiT-XL/2, FlowDCN-XL/2, and DiT-XL/2, respectively—significantly outperforming conventional solvers. Moreover, the approach demonstrates strong generalization across model architectures, image resolutions, and model scales.

Technology Category

Application Category

📝 Abstract
Diffusion models have demonstrated remarkable generation quality but at the cost of numerous function evaluations. Recently, advanced ODE-based solvers have been developed to mitigate the substantial computational demands of reverse-diffusion solving under limited sampling steps. However, these solvers, heavily inspired by Adams-like multistep methods, rely solely on t-related Lagrange interpolation. We show that t-related Lagrange interpolation is suboptimal for diffusion model and reveal a compact search space comprised of time steps and solver coefficients. Building on our analysis, we propose a novel differentiable solver search algorithm to identify more optimal solver. Equipped with the searched solver, rectified-flow models, e.g., SiT-XL/2 and FlowDCN-XL/2, achieve FID scores of 2.40 and 2.35, respectively, on ImageNet256 with only 10 steps. Meanwhile, DDPM model, DiT-XL/2, reaches a FID score of 2.33 with only 10 steps. Notably, our searched solver outperforms traditional solvers by a significant margin. Moreover, our searched solver demonstrates generality across various model architectures, resolutions, and model sizes.
Problem

Research questions and friction points this paper is trying to address.

Optimizing ODE-based solvers for faster diffusion sampling
Addressing suboptimal t-related Lagrange interpolation in solvers
Improving computational efficiency without sacrificing generation quality
Innovation

Methods, ideas, or system contributions that make the work stand out.

Differentiable solver search algorithm
Compact search space optimization
Generality across model architectures
🔎 Similar Papers
No similar papers found.
S
Shuai Wang
State Key Lab of Novel Software Technology, Nanjing University, Nanjing, China
Zexian Li
Zexian Li
Alibaba
AIGCInference Acceleration
Q
Qipeng Zhang
Taobao & Tmall Group of Alibaba, Hangzhou, China
Tianhui Song
Tianhui Song
Nanjing University
computer vision
X
Xubin Li
Taobao & Tmall Group of Alibaba, Hangzhou, China
Tiezheng Ge
Tiezheng Ge
Senior staff algorithm engineer, Alimama, Alibaba Group
Computer VisionAIGCRecommender Systems
B
Bo Zheng
Taobao & Tmall Group of Alibaba, Hangzhou, China
L
Limin Wang
State Key Lab of Novel Software Technology, Nanjing University, Nanjing, China; Shanghai AI Lab, Shanghai, China