Toward Global Large Language Models in Medicine

📅 2026-01-05
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the limited performance of existing large language models in low-resource language medical settings, which hinders equitable global deployment of AI-driven healthcare. To bridge this gap, the authors introduce GlobMed, the first multilingual medical dataset spanning 12 languages—including four low-resource ones—and its accompanying evaluation benchmark, GlobMed-Bench. Building on this foundation, they develop GlobMed-LLMs, a family of multilingual medical large language models with parameter scales ranging from 1.7B to 8B. Experimental results demonstrate that GlobMed-LLMs achieve an average performance improvement of over 40% compared to baseline models, with gains exceeding threefold on low-resource languages. These advances substantially narrow the performance disparity across languages, laying a critical foundation for fair and effective multilingual medical AI applications worldwide.

Technology Category

Application Category

📝 Abstract
Despite continuous advances in medical technology, the global distribution of health care resources remains uneven. The development of large language models (LLMs) has transformed the landscape of medicine and holds promise for improving health care quality and expanding access to medical information globally. However, existing LLMs are primarily trained on high-resource languages, limiting their applicability in global medical scenarios. To address this gap, we constructed GlobMed, a large multilingual medical dataset, containing over 500,000 entries spanning 12 languages, including four low-resource languages. Building on this, we established GlobMed-Bench, which systematically assesses 56 state-of-the-art proprietary and open-weight LLMs across multiple multilingual medical tasks, revealing significant performance disparities across languages, particularly for low-resource languages. Additionally, we introduced GlobMed-LLMs, a suite of multilingual medical LLMs trained on GlobMed, with parameters ranging from 1.7B to 8B. GlobMed-LLMs achieved an average performance improvement of over 40% relative to baseline models, with a more than threefold increase in performance on low-resource languages. Together, these resources provide an important foundation for advancing the equitable development and application of LLMs globally, enabling broader language communities to benefit from technological advances.
Problem

Research questions and friction points this paper is trying to address.

large language models
multilingual medical
low-resource languages
healthcare equity
global medicine
Innovation

Methods, ideas, or system contributions that make the work stand out.

multilingual medical LLMs
low-resource languages
GlobMed dataset
GlobMed-Bench
equitable AI in healthcare
🔎 Similar Papers
No similar papers found.
Rui Yang
Rui Yang
Duke-NUS Medical School
Medical InformaticsMedical Text MiningMedical Knowledge Graph
Huitao Li
Huitao Li
Duke-Nus Medical School
Medical Informatics
Weihao Xuan
Weihao Xuan
The University of Tokyo, RIKEN
Natural Language ProcessingComputer VisionMultimodal AIGenerative AILLM Agent
Heli Qi
Heli Qi
Waseda University, RIKEN
Multi-Modal Learning
Xin Li
Xin Li
University of Science and Technology of China
Data MiningArtificial IntelligenceNeuroscienceAI for Science
K
Kunyu Yu
Duke-NUS AI + Medical Sciences Initiative, Duke-NUS Medical School, Singapore, Singapore
Y
Yingjian Chen
Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
Rongrong Wang
Rongrong Wang
Michigan State University
Applied harmonic analysis
J
Jacques Behmoaras
Duke-NUS AI + Medical Sciences Initiative, Duke-NUS Medical School, Singapore, Singapore
Tianxi Cai
Tianxi Cai
Harvard University
statisticsbiostatisticsmodelingpredictiongenomics
Bibhas Chakraborty
Bibhas Chakraborty
Associate Professor, Duke-NUS Medical School, National University of Singapore
Dynamic Treatment RegimesAdaptive Clinical TrialsMobile and Digital HealthReinforcement Learning
Qingyu Chen
Qingyu Chen
Biomedical Informatics & Data Science, Yale University; NCBI-NLM, National Institutes of Health
Text miningMachine learningData curationBioNLPMedical Imaging Analysis
L
Lionel Tim-Ee Cheng
Radiological Sciences Academic Programme, SingHealth Duke-NUS, Singapore, Singapore
M
Marie-Louise Damwanza
Centre for Biomedical Data Science, Duke-NUS Medical School, Singapore, Singapore
C
Chido Dzinotyiwei
Centre for Biomedical Data Science, Duke-NUS Medical School, Singapore, Singapore
A
Aosong Feng
Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
C
Chuan Hong
Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA
Yusuke Iwasawa
Yusuke Iwasawa
The University of Tokyo
deep learningtransfer learningfoundation modelmeta learning
Y
Yuhe Ke
Duke-NUS AI + Medical Sciences Initiative, Duke-NUS Medical School, Singapore, Singapore
L
Linah Kitala
Centre for Biomedical Data Science, Duke-NUS Medical School, Singapore, Singapore
Taehoon Ko
Taehoon Ko
The Catholic University of Korea
J
Jisan Lee
Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
I
I. Li
Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
J
Jonathan Chong Kai Liew
Duke-NUS AI + Medical Sciences Initiative, Duke-NUS Medical School, Singapore, Singapore
H
Hongfang Liu
McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
L
L. L. Low
Centre for Biomedical Data Science, Duke-NUS Medical School, Singapore, Singapore
Edison Marrese-Taylor
Edison Marrese-Taylor
National Institute of Advanced Industrial Science and Technology (AIST)
Natural Language Processing - Machine Learning
Yutaka Matsuo
Yutaka Matsuo
Professor, University of Tokyo
deep learningweb miningartificial intelligence
I
Isheanesu Misi
Centre for Biomedical Data Science, Duke-NUS Medical School, Singapore, Singapore
Yilin Ning
Yilin Ning
Senior Research Fellow, Centre for Quantitative Medicine, Duke-NUS Medical School
Fair and Ethical AIBiostatisticsEpidemiologyStatistical programming
J
J. Ong
Duke-NUS AI + Medical Sciences Initiative, Duke-NUS Medical School, Singapore, Singapore
M
M. Ong
Department of Emergency Medicine, Singapore General Hospital, Singapore, Singapore
Enrico Petretto
Enrico Petretto
Professor, Director and Associate Dean of Research Informatics at Duke-NUS Medical School, Singapore
Systems-GeneticsGene NetworksFibrosisDrug DiscoveryQuantum Computational Biology
H
H. Rouhizadeh
School of Information Systems, Singapore Management University, Singapore, Singapore
A
Abiram Sandralegar
School of Information Systems, Singapore Management University, Singapore, Singapore
O
Oren Schreier
School of Information Systems, Singapore Management University, Singapore, Singapore
I
Iain Beehuat Tan
Cancer and Stem Cell Biology Program, Duke-NUS Medical School, Singapore, Singapore
Patrick Tan
Patrick Tan
Duke-NUS Medical School
Cancer Research
D
D. Ting
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
J
Junjue Wang
Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
Chunhua Weng
Chunhua Weng
Professor, Columbia University
Biomedical InformaticsClinical Research Informatics
M
Matthew Yu Heng Wong
Department of Emergency Medicine, Singapore General Hospital, Singapore, Singapore
F
Fang Wu
Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Yunze Xiao
Yunze Xiao
Language Technology Institute, Carnegie Mellon University
Natural Language ProcessingComputational Social ScienceAnthropomorphism
Xuhai Xu
Xuhai Xu
Assistant Professor, Columbia University | Google
Human-Computer InteractionUbiquitous ComputingHuman-Centered AImHealthHealth Informatics
Qingcheng Zeng
Qingcheng Zeng
PhD Student in NLP, Northwestern University
Computational Social ScienceNLPComputational Linguistics
Zhuo Zheng
Zhuo Zheng
Stanford University
Earth VisionRemote SensingComputational SustainabilityChange Detection
Yifan Peng
Yifan Peng
Associate Professor at Weill Cornell Medicine
NLPCVmachine learning
Douglas Teodoro
Douglas Teodoro
Professor, University of Geneva
biomedical NLPmachine learning for healthcaremedical informatics
N
Nan Liu
Duke-NUS AI + Medical Sciences Initiative, Duke-NUS Medical School, Singapore, Singapore