🤖 AI Summary
This work addresses the insufficient inductive bias of isotropic Gaussian noise in diffusion probabilistic models (DPMs), which hinders effective modeling of topologically structured data distributions. To this end, we propose a frequency-aware noise control mechanism. Methodologically, we first embed frequency-domain analysis into the diffusion process design: introducing a frequency-domain noise operator, constructing a Fourier-space forward noising schedule, and developing a spectrum-adaptive inversion sampling strategy to enable explicit modeling and controllable suppression of critical frequency bands. Our core contribution lies in breaking the conventional Gaussian noise assumption, thereby enabling interpretable and tunable inductive bias injection in the frequency domain. Experiments on diverse image generation and severe degradation restoration tasks demonstrate significant improvements—average FID reduction of 12.3% and average LPIPS reduction of 18.7%—indicating enhanced fidelity and structural consistency of generated samples.
📝 Abstract
Diffusion Probabilistic Models (DPMs) are powerful generative models that have achieved unparalleled success in a number of generative tasks. In this work, we aim to build inductive biases into the training and sampling of diffusion models to better accommodate the target distribution of the data to model. For topologically structured data, we devise a frequency-based noising operator to purposefully manipulate, and set, these inductive biases. We first show that appropriate manipulations of the noising forward process can lead DPMs to focus on particular aspects of the distribution to learn. We show that different datasets necessitate different inductive biases, and that appropriate frequency-based noise control induces increased generative performance compared to standard diffusion. Finally, we demonstrate the possibility of ignoring information at particular frequencies while learning. We show this in an image corruption and recovery task, where we train a DPM to recover the original target distribution after severe noise corruption.