FusionFM: Fusing Eye-specific Foundational Models for Optimized Ophthalmic Diagnosis

📅 2025-08-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Significant performance disparities, inconsistent cross-task generalization, and unclear ensemble potential exist among state-of-the-art ophthalmic foundation models. Method: We systematically evaluate and fuse four advanced models—RETFound, VisionFM, RetiZero, and DINORET—using two gated fusion strategies, benchmarking uniformly across multinational standardized datasets on glaucoma, diabetic retinopathy, age-related macular degeneration (AMD), diabetes, and hypertension prediction using AUC and F1 scores. Results: RetiZero achieves the strongest cross-dataset generalization, while DINORET and RetiZero jointly attain top overall performance. Ensemble models yield modest yet robust improvements in glaucoma, AMD, and hypertension prediction (+0.5–1.2% AUC), though systemic disease prediction—particularly externally sourced hypertension—remains challenging. This work establishes the first comprehensive evaluation and fusion framework for ophthalmic foundation models spanning both ocular and systemic diseases, providing empirical evidence and a methodological paradigm for clinically deployable multi-task diagnostic systems.

Technology Category

Application Category

📝 Abstract
Foundation models (FMs) have shown great promise in medical image analysis by improving generalization across diverse downstream tasks. In ophthalmology, several FMs have recently emerged, but there is still no clear answer to fundamental questions: Which FM performs the best? Are they equally good across different tasks? What if we combine all FMs together? To our knowledge, this is the first study to systematically evaluate both single and fused ophthalmic FMs. To address these questions, we propose FusionFM, a comprehensive evaluation suite, along with two fusion approaches to integrate different ophthalmic FMs. Our framework covers both ophthalmic disease detection (glaucoma, diabetic retinopathy, and age-related macular degeneration) and systemic disease prediction (diabetes and hypertension) based on retinal imaging. We benchmarked four state-of-the-art FMs (RETFound, VisionFM, RetiZero, and DINORET) using standardized datasets from multiple countries and evaluated their performance using AUC and F1 metrics. Our results show that DINORET and RetiZero achieve superior performance in both ophthalmic and systemic disease tasks, with RetiZero exhibiting stronger generalization on external datasets. Regarding fusion strategies, the Gating-based approach provides modest improvements in predicting glaucoma, AMD, and hypertension. Despite these advances, predicting systemic diseases, especially hypertension in external cohort remains challenging. These findings provide an evidence-based evaluation of ophthalmic FMs, highlight the benefits of model fusion, and point to strategies for enhancing their clinical applicability.
Problem

Research questions and friction points this paper is trying to address.

Evaluating single and fused foundational models for ophthalmic diagnosis
Comparing performance across eye diseases and systemic conditions prediction
Developing fusion strategies to enhance clinical diagnostic accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fusing multiple eye-specific foundational models
Using Gating-based approach for model fusion
Evaluating both ophthalmic and systemic disease prediction
🔎 Similar Papers
No similar papers found.
Ke Zou
Ke Zou
Apple, Inc
Power electronicsSwitched-capacitor ConverterPower Semiconductor Devices
J
Jocelyn Hui Lin Goh
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Centre for Innovation and Precision Eye Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
Y
Yukun Zhou
Centre for Medical Image Computing, University College London, London, United Kingdom
Tian Lin
Tian Lin
Google DeepMind
Machine LearningDeep LearningOnline LearningCombinatorial OptimizationSocial Networks
S
Samantha Min Er Yew
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Centre for Innovation and Precision Eye Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
S
Sahana Srinivasan
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Centre for Innovation and Precision Eye Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
M
Meng Wang
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Centre for Innovation and Precision Eye Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
Rui Santos
Rui Santos
Department of Ophthalmology, Stadtspital Zürich, Zurich, Switzerland; Spross Research Institute, Zurich, Switzerland
G
Gabor M. Somfai
Department of Ophthalmology, Stadtspital Zürich, Zurich, Switzerland; Spross Research Institute, Zurich, Switzerland
Huazhu Fu
Huazhu Fu
Principal Scientist, IHPC, A*STAR
Medical Image AnalysisAI for HealthcareMedical AITrustworthy AI
H
Haoyu Chen
Joint Shantou International Eye Center, Shantou University and the Chinese University of Hong Kong, Shantou, China
P
Pearse A. Keane
NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust, London, United Kingdom; Institute of Ophthalmology, University College London, London, United Kingdom
C
Ching-Yu Cheng
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Centre for Innovation and Precision Eye Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore; Ophthalmology and Visual Science Academic Clinical Program, Duke-NUS Medical School, Singapore, Singapore
Yih Chung Tham
Yih Chung Tham
Yong Loo Lin School of Medicine, National University of Singapore; Singapore Eye Research Institute
OphthalmologyEpidemiologyVisual ImpairmentDeep Learning