DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution

📅 2026-02-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited ability of existing language-conditioned multimodal large models to recognize and restore degraded images in real-world image super-resolution. To overcome this, the authors propose a degradation-aware strategy that introduces a Realistic Embedding Extractor (REE) to enhance degradation content recognition and integrates a Conditional Feature Modulator (CFM) to inject high-level semantic information into a Mamba-based network for high-quality texture recovery. The method uniquely combines the Recognize Anything Model (RAM), contrastive learning, and degradation-aware embeddings, marking the first effort to integrate the Mamba architecture with conditional semantic guidance for real-world image super-resolution. Experiments demonstrate that the proposed approach achieves a superior balance between fidelity and perceptual quality, significantly improving visual reconstruction and validating the potential of Mamba in real-world super-resolution tasks.

Technology Category

Application Category

📝 Abstract
Multimodal large models have shown excellent ability in addressing image super-resolution in real-world scenarios by leveraging language class as condition information, yet their abilities in degraded images remain limited. In this paper, we first revisit the capabilities of the Recognize Anything Model (RAM) for degraded images by calculating text similarity. We find that directly using contrastive learning to fine-tune RAM in the degraded space is difficult to achieve acceptable results. To address this issue, we employ a degradation selection strategy to propose a Real Embedding Extractor (REE), which achieves significant recognition performance gain on degraded image content through contrastive learning. Furthermore, we use a Conditional Feature Modulator (CFM) to incorporate the high-level information of REE for a powerful Mamba-based network, which can leverage effective pixel information to restore image textures and produce visually pleasing results. Extensive experiments demonstrate that the REE can effectively help image super-resolution networks balance fidelity and perceptual quality, highlighting the great potential of Mamba in real-world applications. The source code of this work will be made publicly available at: https://github.com/nathan66666/DACESR.git
Problem

Research questions and friction points this paper is trying to address.

image super-resolution
degraded images
real-world scenarios
multimodal models
condition information
Innovation

Methods, ideas, or system contributions that make the work stand out.

Degradation-Aware
Conditional Embedding
Real-World Super-Resolution
Mamba-based Network
Contrastive Learning
🔎 Similar Papers
No similar papers found.
X
Xiaoyan Lei
School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, Henan, China
Wenlong Zhang
Wenlong Zhang
Shanghai Artificial Intelligence Laboratory
Machine LearningAI4ScienceAutonomous Discovery
B
Biao Luo
School of Automation, Central South University, Changsha 410083, China
H
Hui Liang
School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, Henan, China
W
Weifeng Cao
School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, Henan, China
Q
Qiuting Lin
Intelligent Manufacturing Engineering at Machinery Technology Development Co., Ltd., China Academy of Machinery Science and Technology Group Co., Ltd., Beijing 100801, China