Bidirectional Reward-Guided Diffusion for Real-World Image Super-Resolution

📅 2026-02-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the performance degradation of diffusion-based super-resolution methods on real-world low-resolution images caused by distribution shifts. To this end, the authors propose Bird-SR, a novel framework that employs a bidirectional reward-guided diffusion mechanism to prioritize structural fidelity in early stages and enhance perceptual quality in later stages. Bird-SR innovatively integrates trajectory-level preference optimization with a dynamic fidelity-perception weighting strategy to mitigate reward hacking. By incorporating reward feedback learning, relative advantage space rewards, semantic alignment constraints, and a dynamic weighting mechanism, Bird-SR achieves state-of-the-art performance across multiple real-world super-resolution benchmarks, significantly outperforming existing approaches while simultaneously preserving structural consistency and improving perceptual quality.

Technology Category

Application Category

📝 Abstract
Diffusion-based super-resolution can synthesize rich details, but models trained on synthetic paired data often fail on real-world LR images due to distribution shifts. We propose Bird-SR, a bidirectional reward-guided diffusion framework that formulates super-resolution as trajectory-level preference optimization via reward feedback learning (ReFL), jointly leveraging synthetic LR-HR pairs and real-world LR images. For structural fidelity easily affected in ReFL, the model is directly optimized on synthetic pairs at early diffusion steps, which also facilitates structure preservation for real-world inputs under smaller distribution gap in structure levels. For perceptual enhancement, quality-guided rewards are applied at later sampling steps to both synthetic and real LR images. To mitigate reward hacking, the rewards for synthetic results are formulated in a relative advantage space bounded by their clean counterparts, while real-world optimization is regularized via a semantic alignment constraint. Furthermore, to balance structural and perceptual learning, we adopt a dynamic fidelity-perception weighting strategy that emphasizes structure preservation at early stages and progressively shifts focus toward perceptual optimization at later diffusion steps. Extensive experiments on real-world SR benchmarks demonstrate that Bird-SR consistently outperforms state-of-the-art methods in perceptual quality while preserving structural consistency, validating its effectiveness for real-world super-resolution.
Problem

Research questions and friction points this paper is trying to address.

super-resolution
real-world image
distribution shift
diffusion model
perceptual quality
Innovation

Methods, ideas, or system contributions that make the work stand out.

reward-guided diffusion
trajectory-level preference optimization
real-world super-resolution
semantic alignment
dynamic fidelity-perception weighting
🔎 Similar Papers
No similar papers found.
Z
Zihao Fan
University of Science and Technology of China, HeFei, China
Xin Lu
Xin Lu
University of Science and Technology of China
Computer VisionMachine LearningIntelligent Vehicle
Yidi Liu
Yidi Liu
University of Science and Technology of China
computer vision
Jie Huang
Jie Huang
University of Science and Technology of China
Image processingAIGCVLM
Dong Li
Dong Li
University of Science and Technology of China
X
Xueyang Fu
University of Science and Technology of China, HeFei, China
Z
Zheng-Jun Zha
University of Science and Technology of China, HeFei, China