InstaRevive: One-Step Image Enhancement via Dynamic Score Matching

📅 2025-04-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing diffusion-based image enhancement methods rely on multi-step iterative sampling, incurring substantial computational overhead. To address this, we propose InstaRevive—a single-step, high-fidelity image enhancement framework grounded in dynamic score matching. Its core innovation lies in the first integration of dynamic diffusion range control with cross-modal text guidance—derived from image captioning—to precisely model denoising trajectories. By jointly leveraging score-matching distillation, gradient-aligned distribution matching, and text-conditioned injection, InstaRevive reproduces the generation quality of multi-step diffusion within a single forward pass. Evaluated on low-light, haze, and underwater enhancement benchmarks, InstaRevive achieves state-of-the-art visual quality at significantly reduced inference cost, effectively balancing efficiency and fidelity.

Technology Category

Application Category

📝 Abstract
Image enhancement finds wide-ranging applications in real-world scenarios due to complex environments and the inherent limitations of imaging devices. Recent diffusion-based methods yield promising outcomes but necessitate prolonged and computationally intensive iterative sampling. In response, we propose InstaRevive, a straightforward yet powerful image enhancement framework that employs score-based diffusion distillation to harness potent generative capability and minimize the sampling steps. To fully exploit the potential of the pre-trained diffusion model, we devise a practical and effective diffusion distillation pipeline using dynamic control to address inaccuracies in updating direction during score matching. Our control strategy enables a dynamic diffusing scope, facilitating precise learning of denoising trajectories within the diffusion model and ensuring accurate distribution matching gradients during training. Additionally, to enrich guidance for the generative power, we incorporate textual prompts via image captioning as auxiliary conditions, fostering further exploration of the diffusion model. Extensive experiments substantiate the efficacy of our framework across a diverse array of challenging tasks and datasets, unveiling the compelling efficacy and efficiency of InstaRevive in delivering high-quality and visually appealing results. Code is available at https://github.com/EternalEvan/InstaRevive.
Problem

Research questions and friction points this paper is trying to address.

Reduces iterative sampling steps in diffusion-based image enhancement
Improves accuracy in score matching via dynamic control strategy
Enhances generative guidance using textual prompts from image captions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Score-based diffusion distillation for fast enhancement
Dynamic control for accurate score matching
Textual prompts via image captioning as conditions
🔎 Similar Papers
No similar papers found.
Yixuan Zhu
Yixuan Zhu
Tsinghua University
Computer Vision
Haolin Wang
Haolin Wang
Ph.D. Student. Georgia Institute of Technology
infrastructure monitoringasset managementAIMLcomputer vision
A
Ao Li
Tsinghua Shenzhen International Graduate School, Tsinghua University
Wenliang Zhao
Wenliang Zhao
Tsinghua University
Computer VisionGenerative Models
Y
Yansong Tang
Tsinghua Shenzhen International Graduate School, Tsinghua University
J
Jingxuan Niu
Tsinghua Shenzhen International Graduate School, Tsinghua University
L
Lei Chen
Department of Automation, Tsinghua University
J
Jie Zhou
Department of Automation, Tsinghua University
J
Jiwen Lu
Department of Automation, Tsinghua University