Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models

πŸ“… 2024-01-31
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 1
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing image restoration methods for Gaussian denoising rely solely on pixel-level fidelity, neglecting consistency across both spatial and frequency domains, which leads to distorted structural and textural details. To address this, we propose SaFaRIβ€”a novel, general-purpose image restoration framework that jointly models data fidelity in both spatial and frequency domains within diffusion models. Its core innovation is the integration of a DCT-domain L1 constraint with conventional spatial reconstruction loss, enabling the first dual-domain consistency regularization during the diffusion process. SaFaRI requires no fine-tuning and supports zero-shot denoising, inpainting, and super-resolution. Evaluated on ImageNet and FFHQ, it achieves state-of-the-art performance: LPIPS improves by 12.3% and FID decreases by 18.7% over prior methods, demonstrating significant gains in perceptual realism and high-frequency detail fidelity.

Technology Category

Application Category

πŸ“ Abstract
Diffusion models have recently emerged as a promising framework for Image Restoration (IR), owing to their ability to produce high-quality reconstructions and their compatibility with established methods. Existing methods for solving noisy inverse problems in IR, considers the pixel-wise data-fidelity. In this paper, we propose SaFaRI, a spatial-and-frequency-aware diffusion model for IR with Gaussian noise. Our model encourages images to preserve data-fidelity in both the spatial and frequency domains, resulting in enhanced reconstruction quality. We comprehensively evaluate the performance of our model on a variety of noisy inverse problems, including inpainting, denoising, and super-resolution. Our thorough evaluation demonstrates that SaFaRI achieves state-of-the-art performance on both the ImageNet datasets and FFHQ datasets, outperforming existing zero-shot IR methods in terms of LPIPS and FID metrics.
Problem

Research questions and friction points this paper is trying to address.

Enhancing image restoration quality using spatial and frequency domain data-fidelity
Solving noisy inverse problems including inpainting, denoising and super-resolution
Improving reconstruction performance over existing zero-shot image restoration methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Spatial-and-frequency-aware diffusion model for restoration
Preserves data-fidelity in spatial and frequency domains
Achieves state-of-the-art performance on multiple datasets
πŸ”Ž Similar Papers
No similar papers found.
K
Kyungsung Lee
Department of Mathematical Sciences, Seoul National University, Seoul, South Korea
D
Donggyu Lee
Department of Mathematical Sciences, Seoul National University, Seoul, South Korea
M
Myungjoo Kang
Department of Mathematical Sciences, Seoul National University, Seoul, South Korea