Mean-Shift Distillation for Diffusion Mode Seeking

📅 2025-02-21

📈 Citations: 0

✨ Influential: 0

career value

183K/year

🤖 AI Summary

This work addresses the challenge of accurately approximating the gradient of the output distribution in diffusion models. We propose Mean-Shift Distillation (MSD), the first method to rigorously incorporate mean-shift theory into the diffusion distillation framework. MSD requires no retraining or modification of the sampling procedure; instead, it directly applies mean-shift mode optimization on the output distribution to achieve strict alignment between gradient extrema and true data modes. To efficiently estimate gradients while ensuring both mode alignment and convergence stability, we introduce a product-distribution sampling strategy. Integrated with Stable Diffusion, MSD significantly improves modality alignment accuracy and convergence speed in text-to-image and text-to-3D generation tasks, yielding higher-fidelity outputs.

Technology Category

Application Category

📝 Abstract

We present mean-shift distillation, a novel diffusion distillation technique that provides a provably good proxy for the gradient of the diffusion output distribution. This is derived directly from mean-shift mode seeking on the distribution, and we show that its extrema are aligned with the modes. We further derive an efficient product distribution sampling procedure to evaluate the gradient. Our method is formulated as a drop-in replacement for score distillation sampling (SDS), requiring neither model retraining nor extensive modification of the sampling procedure. We show that it exhibits superior mode alignment as well as improved convergence in both synthetic and practical setups, yielding higher-fidelity results when applied to both text-to-image and text-to-3D applications with Stable Diffusion.

Problem

Research questions and friction points this paper is trying to address.

Improves mode alignment in diffusion models

Enhances convergence in generative applications

Provides efficient sampling without model retraining

Innovation

Methods, ideas, or system contributions that make the work stand out.

mean-shift distillation technique

efficient product distribution sampling

drop-in replacement for SDS

🔎 Similar Papers

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement