LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling

📅 2025-07-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Unified image restoration faces two major challenges: poor generalization and heavy reliance on paired training data—existing methods are either task-specific or require paired supervision, limiting their applicability to open-domain degradations. This paper proposes the first zero-shot, task-agnostic unified restoration framework, eliminating the need for paired data by leveraging a pre-trained latent diffusion model (LDM). Our approach innovatively integrates cyclic posterior sampling, multimodal semantic prior guidance, and a lightweight input alignment module. By modeling degradation-invariant semantic priors directly in the latent space, the method achieves robust restoration under unseen degradations. Extensive experiments demonstrate state-of-the-art performance across diverse tasks—including deblurring, denoising, super-resolution, and deraining—while significantly improving cross-degradation generalization. The framework enables practical zero-shot deployment in real-world scenarios without task-specific fine-tuning or ground-truth supervision.

Technology Category

Application Category

📝 Abstract
Unified image restoration is a significantly challenging task in low-level vision. Existing methods either make tailored designs for specific tasks, limiting their generalizability across various types of degradation, or rely on training with paired datasets, thereby suffering from closed-set constraints. To address these issues, we propose a novel, dataset-free, and unified approach through recurrent posterior sampling utilizing a pretrained latent diffusion model. Our method incorporates the multimodal understanding model to provide sematic priors for the generative model under a task-blind condition. Furthermore, it utilizes a lightweight module to align the degraded input with the generated preference of the diffusion model, and employs recurrent refinement for posterior sampling. Extensive experiments demonstrate that our method outperforms state-of-the-art methods, validating its effectiveness and robustness. Our code and data will be available at https://github.com/AMAP-ML/LD-RPS.
Problem

Research questions and friction points this paper is trying to address.

Unified image restoration across diverse degradations
Eliminating need for task-specific designs or paired datasets
Leveraging latent diffusion for zero-shot semantic priors
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses pretrained latent diffusion model
Incorporates multimodal understanding model
Employs lightweight module for alignment
🔎 Similar Papers
No similar papers found.
Huaqiu Li
Huaqiu Li
Tsinghua University
computer visionmachine learning
Y
Yong Wang
AMAP, Alibaba Group
T
Tongwen Huang
AMAP, Alibaba Group
H
Hailang Huang
AMAP, Alibaba Group
H
Haoqian Wang
Tsinghua University
X
Xiangxiang Chu
AMAP, Alibaba Group