A multimodal AI agent for clinical decision support in ophthalmology

📅 2025-11-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current ophthalmic AI systems suffer from limited flexibility, poor interpretability, and inadequate multimodal integration capabilities. To address these challenges, we propose EyeAgent—the first clinical decision-support intelligent agent framework for ophthalmology, built upon the DeepSeek-V3 large language model. EyeAgent dynamically orchestrates 53 validated, ophthalmology-specific tools across 23 imaging modalities, enabling coordinated analysis for classification, segmentation, detection, and structured report generation. Its modular, interpretable architecture ensures tight alignment with clinical workflows, supporting stepwise reasoning and human-AI collaboration. Evaluated on 200 real-world clinical cases, EyeAgent achieves a 93.7% tool-selection accuracy and receives expert ratings exceeding 88% across usability and reliability metrics. When deployed as a diagnostic aid, it improves overall diagnostic accuracy by 18.51% and enhances report quality by 19%, demonstrating significant clinical utility and robust multimodal reasoning capability.

Technology Category

Application Category

📝 Abstract
Artificial intelligence has shown promise in medical imaging, yet most existing systems lack flexibility, interpretability, and adaptability - challenges especially pronounced in ophthalmology, where diverse imaging modalities are essential. We present EyeAgent, the first agentic AI framework for comprehensive and interpretable clinical decision support in ophthalmology. Using a large language model (DeepSeek-V3) as its central reasoning engine, EyeAgent interprets user queries and dynamically orchestrates 53 validated ophthalmic tools across 23 imaging modalities for diverse tasks including classification, segmentation, detection, image/report generation, and quantitative analysis. Stepwise ablation analysis demonstrated a progressive improvement in diagnostic accuracy, rising from a baseline of 69.71% (using only 5 general tools) to 80.79% when the full suite of 53 specialized tools was integrated. In an expert rating study on 200 real-world clinical cases, EyeAgent achieved 93.7% tool selection accuracy and received expert ratings of more than 88% across accuracy, completeness, safety, reasoning, and interpretability. In human-AI collaboration, EyeAgent matched or exceeded the performance of senior ophthalmologists and, when used as an assistant, improved overall diagnostic accuracy by 18.51% and report quality scores by 19%, with the greatest benefit observed among junior ophthalmologists. These findings establish EyeAgent as a scalable and trustworthy AI framework for ophthalmology and provide a blueprint for modular, multimodal, and clinically aligned next-generation AI systems.
Problem

Research questions and friction points this paper is trying to address.

Developing a flexible AI framework for ophthalmology clinical decision support
Integrating multiple imaging modalities and tools for comprehensive diagnosis
Improving diagnostic accuracy and interpretability in ophthalmology AI systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses large language model as reasoning engine
Dynamically orchestrates 53 specialized ophthalmic tools
Integrates multimodal imaging across 23 modalities
🔎 Similar Papers
No similar papers found.
D
Danli Shi
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
X
Xiaolan Chen
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
B
Bingjie Yan
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
W
Weiyi Zhang
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
P
Pusheng Xu
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
J
Jianchen Yang
Swiss Federal Institute of Technology Lausanne (EPFL), Lausanne, Switzerland
Ruoyu Chen
Ruoyu Chen
Institute of Information Engineering, Chinese Academy of Sciences.
Explainable AITrustworthy AIFoundation Model
Siyu Huang
Siyu Huang
Assistant Professor, Clemson University
computer visionmachine learninggenerative models
Bowen Liu
Bowen Liu
Andreessen Horowitz, insitro, Stanford
Computational ChemistryDrug DiscoveryGraph Machine Learning
X
Xinyuan Wu
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
M
Meng Xie
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
Z
Ziyu Gao
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
Y
Yue Wu
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
S
Senlin Lin
Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China
K
Kai Jin
Department of Ophthalmology, The Second Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, China
X
Xia Gong
School of Optometry, The Hong Kong Polytechnic University, Hong Kong
Y
Y. Tham
Centre for Innovation and Precision Eye Health & Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
X
Xiujuan Zhang
Department of Ophthalmology, The University of Hong Kong, Hong Kong
L
Li Dong
Beijing Tongren Eye Center, Beijing key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Ophthalmology &Visual Sciences Key Laboratory, Medical Artificial Intelligence Research and Verification Key Laboratory of the Ministry of Industry and Information Technology, Beijing Key Laboratory of Intelligent Diagnosis, Treatment and Prevention of Blinding Eye Diseases, Beijing Tongren Hospital, Capital Medical University, Beijing, China
Y
Yuzhou Zhang
Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong
J
Jason C. Yam
Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong
G
Guangming Jin
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, China
X
Xiaohu Ding
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, China
H
Haidong Zou
Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China
Yalin Zheng
Yalin Zheng
University of Liverpool
image processingcomputer visionmachine learning and medical image analysis
Z
Zongyuan Ge
AIM for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Victoria, Australia
M
M. He
School of Optometry, The Hong Kong Polytechnic University, Hong Kong