Zero-shot Medical Event Prediction Using a Generative Pre-trained Transformer on Electronic Health Records

📅 2025-03-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates the zero-shot clinical event prediction capability of GPT-style generative pre-trained models on longitudinal electronic health record (EHR) data—without fine-tuning or labeled supervision. Methodologically, patient-specific temporal sequences of medical concepts are framed as a generative modeling task; next-event prediction is performed via zero-shot prompting and probabilistic decoding, leveraging an EHR-specific temporal concept encoding scheme. Key contributions include: (1) the first systematic validation of zero-shot prediction feasibility using EHR foundation models; and (2) introduction of a “prediction-as-generation” paradigm that eliminates reliance on task-specific fine-tuning and annotated labels. Experiments demonstrate strong performance: average top-1 accuracy of 0.614 and recall of 0.524 across diverse clinical events; notably, high true positive rates and low false positive rates are achieved for 12 critical diagnostic categories, confirming the model’s capacity to capture complex patient trajectories and long-range temporal dependencies.

Technology Category

Application Category

📝 Abstract
Longitudinal data in electronic health records (EHRs) represent an individual`s clinical history through a sequence of codified concepts, including diagnoses, procedures, medications, and laboratory tests. Foundational models, such as generative pre-trained transformers (GPT), can leverage this data to predict future events. While fine-tuning of these models enhances task-specific performance, it is costly, complex, and unsustainable for every target. We show that a foundation model trained on EHRs can perform predictive tasks in a zero-shot manner, eliminating the need for fine-tuning. This study presents the first comprehensive analysis of zero-shot forecasting with GPT-based foundational models in EHRs, introducing a novel pipeline that formulates medical concept prediction as a generative modeling task. Unlike supervised approaches requiring extensive labeled data, our method enables the model to forecast a next medical event purely from a pretraining knowledge. We evaluate performance across multiple time horizons and clinical categories, demonstrating model`s ability to capture latent temporal dependencies and complex patient trajectories without task supervision. Model performance for predicting the next medical concept was evaluated using precision and recall metrics, achieving an average top1 precision of 0.614 and recall of 0.524. For 12 major diagnostic conditions, the model demonstrated strong zero-shot performance, achieving high true positive rates while maintaining low false positives. We demonstrate the power of a foundational EHR GPT model in capturing diverse phenotypes and enabling robust, zero-shot forecasting of clinical outcomes. This capability enhances the versatility of predictive healthcare models and reduces the need for task-specific training, enabling more scalable applications in clinical settings.
Problem

Research questions and friction points this paper is trying to address.

Predicts future medical events using EHR data without fine-tuning.
Introduces zero-shot forecasting with GPT models in healthcare.
Evaluates model performance across clinical categories and time horizons.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Zero-shot prediction using GPT on EHRs
Generative modeling for medical event forecasting
No fine-tuning needed for clinical predictions
🔎 Similar Papers
No similar papers found.
Ekaterina Redekop
Ekaterina Redekop
PhD student, Bioengineering at UCLA
Deep learningComputer VisionImage SegmentationMachine Learning
Z
Zichen Wang
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, USA; Department of Radiology, University of California, Los Angeles, Los Angeles, USA; Department of Bioengineering, University of California, Los Angeles, Los Angeles, USA
R
Rushikesh Kulkarni
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, USA; Department of Radiology, University of California, Los Angeles, Los Angeles, USA
M
Mara Pleasure
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, USA; Department of Radiology, University of California, Los Angeles, Los Angeles, USA; Bioinformatics, University of California, Los Angeles, Los Angeles, USA
A
Aaron Chin
Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, USA; Health Department of Information Services and Solutions, University of California, Los Angeles, University of California, Los Angeles, Los Angeles, USA
Hamid Reza Hassanzadeh
Hamid Reza Hassanzadeh
PhD, Georgia Institute of Technology
Machine LearningArtificial IntelligenceData ScienceBioinformaticsBiomedical Engineering
B
Brian L. Hill
Optum AI
Melika Emami
Melika Emami
Optum AI
Machine LearningDeep LearningML for Health
William Speier
William Speier
UCLA
Machine learningbrain-computer interfacesmedical image analysis
Corey W. Arnold
Corey W. Arnold
Professor, Depts of Radiology, Pathology, Bioengineering, and Electrical & Computer Engineering
Machine LearningDeep LearningMedical ImagingmHealthMedicine