A foundation model for human-AI collaboration in medical literature mining

πŸ“… 2025-01-27
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Current AI systems exhibit poor generalizability and low accuracy in cross-disease medical literature retrieval, screening, and data extraction. To address this, we propose LEADSβ€”the first foundation model explicitly designed for human-AI collaborative medical literature mining, deeply aligned with systematic review (SR) expert workflows. LEADS is fine-tuned on 630K high-quality medical instructions derived from 21K SRs, 450K clinical trial publications, and 27K trial registry entries, enabling traceable reference generation and interactive task execution. On six core biomedical NLP tasks, LEADS consistently outperforms four leading large language models. Under expert collaboration, it achieves a study screening recall of 0.81 (+4% absolute improvement) and data extraction accuracy of 0.85 (+5%), while reducing average processing time by over 25%. These advances significantly enhance the efficiency and reliability of evidence-based medicine practice.

Technology Category

Application Category

πŸ“ Abstract
Systematic literature review is essential for evidence-based medicine, requiring comprehensive analysis of clinical trial publications. However, the application of artificial intelligence (AI) models for medical literature mining has been limited by insufficient training and evaluation across broad therapeutic areas and diverse tasks. Here, we present LEADS, an AI foundation model for study search, screening, and data extraction from medical literature. The model is trained on 633,759 instruction data points in LEADSInstruct, curated from 21,335 systematic reviews, 453,625 clinical trial publications, and 27,015 clinical trial registries. We showed that LEADS demonstrates consistent improvements over four cutting-edge generic large language models (LLMs) on six tasks. Furthermore, LEADS enhances expert workflows by providing supportive references following expert requests, streamlining processes while maintaining high-quality results. A study with 16 clinicians and medical researchers from 14 different institutions revealed that experts collaborating with LEADS achieved a recall of 0.81 compared to 0.77 experts working alone in study selection, with a time savings of 22.6%. In data extraction tasks, experts using LEADS achieved an accuracy of 0.85 versus 0.80 without using LEADS, alongside a 26.9% time savings. These findings highlight the potential of specialized medical literature foundation models to outperform generic models, delivering significant quality and efficiency benefits when integrated into expert workflows for medical literature mining.
Problem

Research questions and friction points this paper is trying to address.

Medical AI
Literature Review
Efficiency and Accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

LEADS AI System
Medical Literature Analysis
Enhanced Efficiency and Accuracy
πŸ”Ž Similar Papers
No similar papers found.
Z
Zifeng Wang
School of Computing and Data Science, University of Illinois Urbana-Champaign, Urbana, IL, USA
Lang Cao
Lang Cao
CS PhD Student at University of Illinois Urbana-Champaign
Machine LearningMachine ReasoningAI for Health
Q
Qiao Jin
Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
J
Joey Chan
Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
N
Nicholas Wan
Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
B
Behdad Afzali
Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
H
Hyun-Jin Cho
Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Chang-In Choi
Chang-In Choi
Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
M
Mehdi Emamverdi
National Eye Institute, National Institutes of Health, Bethesda, MD, USA
M
Manjot K. Gill
Department of Ophthalmology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
S
Sun-Hyung Kim
Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA; Division of Pulmonary and Critical Care Medicine, Department of Medicine, Chungbuk National University Hospital, Chungbuk National University College of Medicine, Cheongju, Republic of Korea
Y
Yijia Li
Department of Medicine, University of Pittsburgh Medical Center, Pittsburgh, PA, USA
Y
Yi Liu
Department of Medicine, Weill Cornell Medicine, New York, NY, USA
H
Hanley Ong
Department of Radiology, Weill Cornell Medicine, New York, NY, USA
J
Justin Rousseau
Department of Neurology, UT Southwestern Medical Center, Dallas, TX, USA
I
Irfan Sheikh
Department of Neurology, UT Southwestern Medical Center, Dallas, TX, USA
J
Jenny J. Wei
Department of Dermatology, University of Washington, Seattle, WA, USA
Ziyang Xu
Ziyang Xu
The Chinese University of Hong Kong
AI for ScienceBioinformaticsMedical Image Processing
C
Christopher M. Zallek
OSF HealthCare Illinois Neurological Institute, Peoria, IL, USA
Kyungsang Kim
Kyungsang Kim
Assistant Professor at Harvard Medical School and Mass General Hospital
Deep learningLogical AICompressed sensingMedical imagingOptimization
Y
Yifan Peng
Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Zhiyong Lu
Zhiyong Lu
Senior Investigator, NLM; Adjunct Professor of CS, UIUC
BioNLPBiomedical InformaticsMedical AIArtificial Intelligence
Jimeng Sun
Jimeng Sun
Professor at University of Illinois Urbana-Champaign
AI for healthcareMachine learning for healthcaredeep learning for healthcare