Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data

📅 2024-09-02
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This study systematically compares large language models (LLMs) with classical machine learning (CML) methods for mortality prediction from high-dimensional tabular clinical data of COVID-19 patients—a task traditionally dominated by CMLs. Method: Structured electronic health record features are converted into natural language prompts; GPT-4 is applied in zero-shot classification, while Mistral-7B is fine-tuned via QLoRA. Performance is benchmarked against XGBoost and Random Forest. Contribution/Results: CMLs achieve strong internal/external validation F1-scores of 0.87/0.83. GPT-4 zero-shot performs poorly (F1 = 0.43), whereas QLoRA-finetuned Mistral-7B attains F1 = 0.74 and recall = 79%, with stable external generalization. Crucially, this work demonstrates that lightweight LLMs—when efficiently adapted via parameter-efficient fine-tuning—can approach the predictive performance of established CMLs on structured healthcare prediction tasks. It establishes a novel paradigm for leveraging LLMs in tabular biomedical data analysis, bridging the gap between natural language processing and clinical informatics.

Technology Category

Application Category

📝 Abstract
Background: This study aimed to evaluate and compare the performance of classical machine learning models (CMLs) and large language models (LLMs) in predicting mortality associated with COVID-19 by utilizing a high-dimensional tabular dataset. Materials and Methods: We analyzed data from 9,134 COVID-19 patients collected across four hospitals. Seven CML models, including XGBoost and random forest (RF), were trained and evaluated. The structured data was converted into text for zero-shot classification by eight LLMs, including GPT-4 and Mistral-7b. Additionally, Mistral-7b was fine-tuned using the QLoRA approach to enhance its predictive capabilities. Results: Among the CML models, XGBoost and RF achieved the highest accuracy, with F1 scores of 0.87 for internal validation and 0.83 for external validation. In the LLM category, GPT-4 was the top performer with an F1 score of 0.43. Fine-tuning Mistral-7b significantly improved its recall from 1% to 79%, resulting in an F1 score of 0.74, which was stable during external validation. Conclusion: While LLMs show moderate performance in zero-shot classification, fine-tuning can significantly enhance their effectiveness, potentially aligning them closer to CML models. However, CMLs still outperform LLMs in high-dimensional tabular data tasks.
Problem

Research questions and friction points this paper is trying to address.

Comparing classical ML and LLMs for COVID-19 mortality prediction
Evaluating performance using high-dimensional tabular patient data
Assessing fine-tuning impact on LLMs for medical classification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Used XGBoost and Random Forest for high-dimensional data
Applied zero-shot classification with GPT-4 and Mistral-7b
Fine-tuned Mistral-7b using QLoRA approach for improvement
🔎 Similar Papers
No similar papers found.
M
Mohammadreza Ghaffarzadeh-Esfahani
Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
M
Mahdi Ghaffarzadeh-Esfahani
Faculty of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
A
Arian Salahi-Niri
Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
H
Hossein Toreyhi
Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Zahra Atf
Zahra Atf
Interdisciplinary Researcher, PhD in Business.
Digital MarketingExplainable AILLMsMoral PhilosophyCognitive Science.
A
Amirali Mohsenzadeh-Kermani
Faculty of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
M
Mahshad Sarikhani
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
Z
Zohreh Tajabadi
Digestive Disease Research Institute, Tehran University of Medical Sciences, Tehran, Iran
Fatemeh Shojaeian
Fatemeh Shojaeian
Department of Surgery, The Johns Hopkins University, Baltimore, MD, USA
M
Mohammad Hassan Bagheri
Faculty of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
A
Aydin Feyzi
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
M
Mohammadamin Tarighatpayma
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
N
Narges Gazmeh
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
F
Fateme Heydari
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
H
Hossein Afshar
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
A
Amirreza Allahgholipour
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
F
Farid Alimardani
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
A
Ameneh Salehi
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
N
Naghmeh Asadimanesh
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
Mohammad Amin Khalafi
Mohammad Amin Khalafi
Research Fellow at Research Institute for Gastroenterology and Liver Diseases
AILLMInternal Medicine
H
Hadis Shabanipour
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Ali Moradi
Ali Moradi
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
S
Sajjad Hossein Zadeh
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
O
Omid Yazdani
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
R
Romina Esbati
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
M
Moozhan Maleki
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
D
Danial Samiei Nasr
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
A
Amirali Soheili
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
H
Hossein Majlesi
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
S
Saba Shahsavan
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
A
Alireza Soheilipour
School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
N
Nooshin Goudarzi
Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
E
Erfan Taherifard
MPH department, Shiraz University of Medical Sciences, Shiraz, Iran
H
Hamidreza Hatamabadi
Department of Emergency Medicine, School of Medicine, Safety Promotion and Injury Prevention Research Center, Imam Hossein Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran
J
Jamil S. Samaan
Karsh Division of Gastroenterology and Hepatology, Cedars-Sinai Medical Center, 8700 Beverly B
Thomas Savage
Thomas Savage
University of Pennsylvania
Artificial Intelligence in Medicine
A
Ankit Sakhuja
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Ali Soroush
Ali Soroush
Icahn School of Medicine at Mount Sinai
Gastroenterology Artificial Intelligence Machine Learning
Girish Nadkarni
Girish Nadkarni
Icahn School of Medicine at Mount Sinai
HypertensionGeneticsKidney DiseaseAIMachine Learning
Ilad Alavi Darazam
Ilad Alavi Darazam
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Mohamad Amin Pourhoseingholi
Mohamad Amin Pourhoseingholi
Student Research Committee, School of Nursing and Midwifery, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Seyed Amir Ahmad Safavi-Naini
Seyed Amir Ahmad Safavi-Naini
Research Fellow at Research Institute for Gastroenterology and Liver Diseases
Gastrointestinal CancerPancreatic CancerCancer PreventionPrecision Medicine