RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration

📅 2025-05-23

📈 Citations: 0

✨ Influential: 0

career value

210K/year

🤖 AI Summary

To address the slow inference and poor real-time deployability of latent diffusion models (LDMs) in all-in-one image restoration (AiOR), this paper pioneers the integration of vision autoregressive (VAR) modeling into AiOR, proposing an efficient generative framework. Methodologically, it replaces iterative denoising with single-step generation. Key contributions include: (1) a cross-scale cross-attention mechanism that enhances multi-granularity feature interaction; and (2) a latent-space refinement module that improves reconstruction fidelity and generalization. Evaluated on multiple AiOR benchmarks, the method achieves state-of-the-art performance, accelerating inference by over 10× compared to LDMs while preserving strong generalization across diverse degradation types. This work significantly advances the practical deployment of generative image restoration.

Technology Category

Application Category

📝 Abstract

The use of latent diffusion models (LDMs) such as Stable Diffusion has significantly improved the perceptual quality of All-in-One image Restoration (AiOR) methods, while also enhancing their generalization capabilities. However, these LDM-based frameworks suffer from slow inference due to their iterative denoising process, rendering them impractical for time-sensitive applications. To address this, we propose RestoreVAR, a novel generative approach for AiOR that significantly outperforms LDM-based models in restoration performance while achieving over $mathbf{10 imes}$ faster inference. RestoreVAR leverages visual autoregressive modeling (VAR), a recently introduced approach which performs scale-space autoregression for image generation. VAR achieves comparable performance to that of state-of-the-art diffusion transformers with drastically reduced computational costs. To optimally exploit these advantages of VAR for AiOR, we propose architectural modifications and improvements, including intricately designed cross-attention mechanisms and a latent-space refinement module, tailored for the AiOR task. Extensive experiments show that RestoreVAR achieves state-of-the-art performance among generative AiOR methods, while also exhibiting strong generalization capabilities.

Problem

Research questions and friction points this paper is trying to address.

Improving slow inference in LDM-based All-in-One image Restoration

Enhancing restoration performance with visual autoregressive modeling

Reducing computational costs while maintaining high-quality image generation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Visual autoregressive modeling for image restoration

Cross-attention mechanisms for enhanced performance

Latent-space refinement module for AiOR tasks

🔎 Similar Papers

No similar papers found.