Are LLMs Truly Multilingual? Exploring Zero-Shot Multilingual Capability of LLMs for Information Retrieval: An Italian Healthcare Use Case

📅 2025-12-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Prior work lacks localized, clinical empirical evaluation of open-source multilingual large language models (LLMs) for comorbidity extraction from Italian electronic health records (EHRs) in zero-shot settings. Method: We systematically assess the real-time zero-shot comorbidity extraction capability of multilingual LLMs on authentic Italian clinical texts, deploying models locally and benchmarking them against rule-based matching and human annotation. Contribution/Results: Our quantitative analysis reveals that state-of-the-art multilingual LLMs underperform rule-based methods significantly and exhibit poor generalization across disease categories—highlighting fundamental limitations in domain-specific clinical text understanding. This study provides critical empirical evidence and methodological guidance on the applicability of multilingual LLMs to low-resource clinical natural language processing tasks, particularly in under-resourced languages and specialized medical domains.

Technology Category

Application Category

📝 Abstract
Large Language Models (LLMs) have become a key topic in AI and NLP, transforming sectors like healthcare, finance, education, and marketing by improving customer service, automating tasks, providing insights, improving diagnostics, and personalizing learning experiences. Information extraction from clinical records is a crucial task in digital healthcare. Although traditional NLP techniques have been used for this in the past, they often fall short due to the complexity, variability of clinical language, and high inner semantics in the free clinical text. Recently, Large Language Models (LLMs) have become a powerful tool for better understanding and generating human-like text, making them highly effective in this area. In this paper, we explore the ability of open-source multilingual LLMs to understand EHRs (Electronic Health Records) in Italian and help extract information from them in real-time. Our detailed experimental campaign on comorbidity extraction from EHR reveals that some LLMs struggle in zero-shot, on-premises settings, and others show significant variation in performance, struggling to generalize across various diseases when compared to native pattern matching and manual annotations.
Problem

Research questions and friction points this paper is trying to address.

Assess multilingual LLMs' zero-shot ability for Italian EHR information retrieval
Explore LLM performance in extracting comorbidities from clinical records
Compare LLMs against traditional methods in healthcare data extraction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Zero-shot multilingual LLMs for Italian EHR extraction
Comparative evaluation of open-source models in healthcare
Real-time information retrieval from clinical records
🔎 Similar Papers
No similar papers found.
V
Vignesh Kumar Kembu
Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Italy
P
Pierandrea Morandini
IRCCS Humanitas Research Hospital, Milan, Italy
M
Marta Bianca Maria Ranzini
IRCCS Humanitas Research Hospital, Milan, Italy
Antonino Nocera
Antonino Nocera
Associate Professor, University of Pavia
Artificial IntelligenceSecurityPrivacyData Science