Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction

πŸ“… 2026-03-25
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the limitations of existing electronic health record (EHR) pretraining approaches, which struggle to effectively model both the recurrence and emergence of clinical events and often suffer from inflated evaluation metrics due to repeated events. To overcome these challenges, we propose RAVENβ€”a recurrence-aware generative autoregressive pretraining framework that learns by predicting the full sequence of clinical events in the next visit. RAVEN incorporates a recurrence regularization mechanism to mitigate evaluation bias and leverages clinical event tokenization with zero-shot transfer strategies. In zero-shot prediction tasks across multiple disease incidences, RAVEN matches the performance of fully fine-tuned Transformer models and significantly outperforms conventional next-token prediction methods. Furthermore, it demonstrates strong generalization on external cohorts and reveals a synergistic scaling law between model and data size under data-constrained conditions.

Technology Category

Application Category

πŸ“ Abstract
While large-scale pretraining has revolutionized language modeling, its potential remains underexplored in healthcare with structured electronic health records (EHRs). We present RAVEN, a novel generative pretraining strategy for sequential EHR data based on Recurrence-Aware next-Visit EveNt prediction. Leveraging a dataset of over one million unique individuals, our model learns to autoregressively generate tokenized clinical events for the next visit conditioned on patient history. We introduce regularization on predicting repeated events and highlight a key pitfall in EHR-based foundation model evaluations: repeated event tokens can inflate performance metrics when new onsets are not distinguished from subsequent occurrences. Furthermore, we empirically investigate the scaling behaviors in a data-constrained, compute-saturated regime, showing that simply increasing model size is suboptimal without commensurate increases in data volume. We evaluate our model via zero-shot prediction for forecasting the incidence of a diverse set of diseases, where it rivals fully fine-tuned representation-based Transformer models and outperforms widely used simulation-based next-token approaches. Finally, without additional parameter updates, we show that RAVEN can generalize to an external patient cohort under lossy clinical code mappings and feature coverage gaps.
Problem

Research questions and friction points this paper is trying to address.

electronic health records
next-visit prediction
foundation models
recurrent events
model scaling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Recurrence-Aware Modeling
Next-Visit Prediction
EHR Foundation Models
Zero-Shot Clinical Forecasting
Scaling Laws in Healthcare AI
πŸ”Ž Similar Papers
2024-07-262024 IEEE 6th International Conference on Power, Intelligent Computing and Systems (ICPICS)Citations: 10
H
Haresh Rengaraj Rajamohan
Center for Data Science, New York University, New York, NY, USA
X
Xiang Gao
Center for Data Science, New York University, New York, NY, USA
Weicheng Zhu
Weicheng Zhu
Center for Data Science, New York University
Machine learning
S
Shih-Lun Huang
Center for Data Science, New York University, New York, NY, USA
Long Chen
Long Chen
East China University of Science and Technology
Battery Electrochemistry
G
Gabe Schulman
Center for Data Science, New York University, New York, NY, USA
H
Huizhen Jin
Center for Data Science, New York University, New York, NY, USA
S
Shengduo Li
Center for Data Science, New York University, New York, NY, USA
Y
Yixuan Wang
Center for Data Science, New York University, New York, NY, USA
H
Huidi Yang
Center for Data Science, New York University, New York, NY, USA
Kyunghyun Cho
Kyunghyun Cho
New York University, Genentech
Machine LearningDeep Learning
C
Cem M. Deniz
Department of Radiology, NYU Grossman School of Medicine, New York, NY, USA; Center for Biomedical Imaging, NYU Grossman School of Medicine, New York, NY, USA
Narges Razavian
Narges Razavian
New York University Medical Center
Machine Learning for Medicine