Multi-Scale Target-Aware Representation Learning for Fundus Image Enhancement

📅 2025-05-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Low-quality fundus images suffer from multi-scale information loss and insufficient lesion region enhancement. Method: This paper proposes the first end-to-end lightweight enhancement framework, integrating wavelet-based multi-scale encoding, a structure-preserving hierarchical group attention decoder, and an unsupervised targeted lesion-aware aggregation mechanism—enabling concurrent anatomical fidelity preservation and pathological region enhancement without requiring lesion annotations. Contribution/Results: Extensive experiments demonstrate significant PSNR/SSIM improvements over state-of-the-art methods across multiple public datasets, with a 37% reduction in model parameters. Moreover, the framework exhibits strong generalizability: zero-shot transfer to other ophthalmic imaging tasks achieves competitive performance, confirming its robustness and broad applicability in clinical fundus image enhancement.

Technology Category

Application Category

📝 Abstract
High-quality fundus images provide essential anatomical information for clinical screening and ophthalmic disease diagnosis. Yet, due to hardware limitations, operational variability, and patient compliance, fundus images often suffer from low resolution and signal-to-noise ratio. Recent years have witnessed promising progress in fundus image enhancement. However, existing works usually focus on restoring structural details or global characteristics of fundus images, lacking a unified image enhancement framework to recover comprehensive multi-scale information. Moreover, few methods pinpoint the target of image enhancement, e.g., lesions, which is crucial for medical image-based diagnosis. To address these challenges, we propose a multi-scale target-aware representation learning framework (MTRL-FIE) for efficient fundus image enhancement. Specifically, we propose a multi-scale feature encoder (MFE) that employs wavelet decomposition to embed both low-frequency structural information and high-frequency details. Next, we design a structure-preserving hierarchical decoder (SHD) to fuse multi-scale feature embeddings for real fundus image restoration. SHD integrates hierarchical fusion and group attention mechanisms to achieve adaptive feature fusion while retaining local structural smoothness. Meanwhile, a target-aware feature aggregation (TFA) module is used to enhance pathological regions and reduce artifacts. Experimental results on multiple fundus image datasets demonstrate the effectiveness and generalizability of MTRL-FIE for fundus image enhancement. Compared to state-of-the-art methods, MTRL-FIE achieves superior enhancement performance with a more lightweight architecture. Furthermore, our approach generalizes to other ophthalmic image processing tasks without supervised fine-tuning, highlighting its potential for clinical applications.
Problem

Research questions and friction points this paper is trying to address.

Enhancing low-quality fundus images with multi-scale details
Unifying structural and global feature restoration in fundus images
Target-aware enhancement focusing on lesions for better diagnosis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-scale feature encoder with wavelet decomposition
Structure-preserving hierarchical decoder with group attention
Target-aware feature aggregation for lesion enhancement
🔎 Similar Papers
No similar papers found.
H
Haofan Wu
Research Center for Translational Medicine, Medical Innovation Center and State Key Laboratory of Cardiology, Shanghai East Hospital, Shanghai, 200120, China; The Institute for Biomedical Engineering & Nano Science, Tongji University School of Medicine, Shanghai, 200120, China; Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
Yin Huang
Yin Huang
Research Assistant, University of Florida
Multi-Armed BanditsEdge ComputingWireless CommunicationsQuantum Networking
Y
Yuqing Wu
Research Center for Translational Medicine, Medical Innovation Center and State Key Laboratory of Cardiology, Shanghai East Hospital, Shanghai, 200120, China; The Institute for Biomedical Engineering & Nano Science, Tongji University School of Medicine, Shanghai, 200120, China
Q
Qiuyu Yang
Research Center for Translational Medicine, Medical Innovation Center and State Key Laboratory of Cardiology, Shanghai East Hospital, Shanghai, 200120, China; The Institute for Biomedical Engineering & Nano Science, Tongji University School of Medicine, Shanghai, 200120, China
B
Bingfang Wang
Research Center for Translational Medicine, Medical Innovation Center and State Key Laboratory of Cardiology, Shanghai East Hospital, Shanghai, 200120, China; The Institute for Biomedical Engineering & Nano Science, Tongji University School of Medicine, Shanghai, 200120, China
L
Li Zhang
Department of Ophthalmology, Tongji Hospital, School of Medicine, Tongji University, Shanghai, 200000, China
M
Muhammad Fahadullah Khan
Ophthalmology, Isra Postgraduate Institute of Ophthalmology, Karachi, Pakistan
A
Ali Zia
Ophthalmology, Isra Postgraduate Institute of Ophthalmology, Karachi, Pakistan; The Eye Centre, South City Hospital, Karachi, Pakistan
M
M. Memon
Ophthalmology, Isra Postgraduate Institute of Ophthalmology, Karachi, Pakistan
S
Syed Sohail Bukhari
Ophthalmology, Isra Postgraduate Institute of Ophthalmology, Karachi, Pakistan
A
Abdul Fattah Memon
Ophthalmology, Isra Postgraduate Institute of Ophthalmology, Karachi, Pakistan
D
Daizong Ji
Research Center for Translational Medicine, Medical Innovation Center and State Key Laboratory of Cardiology, Shanghai East Hospital, Shanghai, 200120, China; The Institute for Biomedical Engineering & Nano Science, Tongji University School of Medicine, Shanghai, 200120, China
Ya Zhang
Ya Zhang
Shanghai Jiao Tong University
Machine learningComputer visionMedical Imaging
G
Ghulam Mustafa
Center for the Development of Laboratory Equipment, Pakistan Council of Scientific and Industrial Research, Karachi, Pakistan
Yin Fang
Yin Fang
National Institutes of Health
AI4BioinformaticsKnowledge GraphLanguage Model