FundusGAN: A Hierarchical Feature-Aware Generative Framework for High-Fidelity Fundus Image Generation

πŸ“… 2025-03-22
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the challenge of limited fundus image data hindering pretraining of ophthalmic AI models, this paper proposes a hierarchical feature-aware generative framework. The method innovatively integrates a feature pyramid encoder with a modified StyleGAN architecture to jointly preserve anatomical structure fidelity and model pathological details. Dilated convolutions and adaptive upsampling are incorporated to enhance multi-scale feature representation. Extensive validation is conducted on multi-center datasetsβ€”DDR, DRIVE, and IDRiD. On DDR, the generated images achieve SSIM = 0.8863 and FID = 54.2. When used for few-shot training, the synthetic data boost ResNet50’s retinal disease diagnosis accuracy by 6.49%. This work establishes a scalable, generative solution for efficient development of ophthalmic AI models under low-data regimes.

Technology Category

Application Category

πŸ“ Abstract
Recent advancements in ophthalmology foundation models such as RetFound have demonstrated remarkable diagnostic capabilities but require massive datasets for effective pre-training, creating significant barriers for development and deployment. To address this critical challenge, we propose FundusGAN, a novel hierarchical feature-aware generative framework specifically designed for high-fidelity fundus image synthesis. Our approach leverages a Feature Pyramid Network within its encoder to comprehensively extract multi-scale information, capturing both large anatomical structures and subtle pathological features. The framework incorporates a modified StyleGAN-based generator with dilated convolutions and strategic upsampling adjustments to preserve critical retinal structures while enhancing pathological detail representation. Comprehensive evaluations on the DDR, DRIVE, and IDRiD datasets demonstrate that FundusGAN consistently outperforms state-of-the-art methods across multiple metrics (SSIM: 0.8863, FID: 54.2, KID: 0.0436 on DDR). Furthermore, disease classification experiments reveal that augmenting training data with FundusGAN-generated images significantly improves diagnostic accuracy across multiple CNN architectures (up to 6.49% improvement with ResNet50). These results establish FundusGAN as a valuable foundation model component that effectively addresses data scarcity challenges in ophthalmological AI research, enabling more robust and generalizable diagnostic systems while reducing dependency on large-scale clinical data collection.
Problem

Research questions and friction points this paper is trying to address.

Generates high-fidelity fundus images to overcome data scarcity
Improves diagnostic accuracy with synthetic training data augmentation
Preserves retinal structures while enhancing pathological detail representation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical feature-aware generative framework
Feature Pyramid Network for multi-scale extraction
Modified StyleGAN with dilated convolutions
πŸ”Ž Similar Papers
No similar papers found.
Qingshan Hou
Qingshan Hou
Northeastern University; National University of Singapore
medical image analysisfoundation modeldeep learning
M
Meng Wang
Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
P
Peng Cao
Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China
Z
Zou Ke
Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
X
Xiaoli Liu
Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China
Huazhu Fu
Huazhu Fu
Principal Scientist, IHPC, A*STAR
Medical Image AnalysisAI for HealthcareMedical AITrustworthy AI
O
Osmar R. Zaiane
Alberta Machine Intelligence Institute, University of Alberta, Edmonton, Canada