WaveNet-SF: A Hybrid Network for Retinal Disease Detection Based on Wavelet Transform in the Spatial-Frequency Domain

πŸ“… 2025-01-21
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Early diagnosis of retinal diseases from OCT images remains challenging due to severe speckle noise, complex lesion morphology, and highly variable lesion scales. To address these issues, this paper proposes a spatial-frequency joint learning framework. Its core contributions are: (1) a novel multi-scale wavelet spatial-attention (MSW-SA) module that enhances lesion region localization; and (2) a high-frequency feature compensation (HFFC) block that effectively restores edge details while suppressing speckle noise. By integrating wavelet transform, multi-scale feature modeling, and dual-domain (spatial and frequency) attention mechanisms, the method achieves state-of-the-art classification accuracy of 97.82% on OCT-C8 and 99.58% on OCT2017β€”surpassing prior methods on both benchmarks. The framework demonstrates significantly improved robustness to noise and superior fine-grained discriminative capability for subtle pathological patterns.

Technology Category

Application Category

πŸ“ Abstract
Retinal diseases are a leading cause of vision impairment and blindness, with timely diagnosis being critical for effective treatment. Optical Coherence Tomography (OCT) has become a standard imaging modality for retinal disease diagnosis, but OCT images often suffer from issues such as speckle noise, complex lesion shapes, and varying lesion sizes, making interpretation challenging. In this paper, we propose a novel framework, WaveNet-SF, to enhance retinal disease detection by integrating spatial-domain and frequency-domain learning. The framework utilizes wavelet transforms to decompose OCT images into low- and high-frequency components, enabling the model to extract both global structural features and fine-grained details. To improve lesion detection, we introduce a multi-scale wavelet spatial attention (MSW-SA) module, which enhances the model's focus on regions of interest at multiple scales. Additionally, a high-frequency feature compensation block (HFFC) is incorporated to recover edge information lost during wavelet decomposition, suppress noise, and preserve fine details crucial for lesion detection. Our approach achieves state-of-the-art (SOTA) classification accuracies of 97.82% and 99. 58% on the OCT-C8 and OCT2017 datasets, respectively, surpassing existing methods. These results demonstrate the efficacy of WaveNet-SF in addressing the challenges of OCT image analysis and its potential as a powerful tool for retinal disease diagnosis.
Problem

Research questions and friction points this paper is trying to address.

Retinal Disease Diagnosis
Optical Coherence Tomography (OCT) Imaging
Noise and Variability in Lesion Characteristics
Innovation

Methods, ideas, or system contributions that make the work stand out.

WaveNet-SF
Multi-scale Wavelet Spatial Attention
High-frequency Feature Compensation
πŸ”Ž Similar Papers
No similar papers found.
J
Jilan Cheng
School of Information Engineering, Nanchang University, Nanchang, 330031, China
G
Guoli Long
School of Information Engineering, Nanchang University, Nanchang, 330031, China
Z
Zeyu Zhang
School of Information Engineering, Nanchang University, Nanchang, 330031, China
Z
Zhenjia Qi
School of Information Engineering, Nanchang University, Nanchang, 330031, China
H
Hanyu Wang
School of Advanced Energy, Sun Yat-sen University, Shenzhen, 518107, China
Libin Lu
Libin Lu
School of Mathematics and Computer Science, Wuhan Polytechnic University, Wuhan, 430023, China
Shuihua Wang
Shuihua Wang
Xi'an Jiaotong-Liverpool University
Artificial intelligenceMachine LearningBio-Medical image processingDeep learning
Yudong Zhang
Yudong Zhang
University of Leicester, HFWLA/FIET/FEAI/FBCS/SMIEEE/SMACM/DSACM, Clarivate Highly Cited Researcher
artificial intelligencedeep learningmedical image processing
Jin Hong
Jin Hong
Seoul National University