TwinWeaver: An LLM-Based Foundation Model Framework for Pan-Cancer Digital Twins

📅 2026-01-28

📈 Citations: 0

✨ Influential: 0

career value

188K/year

🤖 AI Summary

This study addresses the challenge of effectively modeling sparse, multimodal clinical time-series data for predicting patient events and disease trajectories in precision oncology. To this end, the authors propose the Pan-Cancer Digital Twin (GDT) framework, which, for the first time, leverages large language models (LLMs) to model longitudinal patient histories as textual sequences, enabling unified clinical event prediction and trajectory inference. The approach supports zero-shot generalization to clinical trial data and provides interpretable clinical reasoning. Evaluated on a cohort of 93,054 patients, GDT significantly outperforms baseline methods, achieving a mean absolute scaled error (MASE) of 0.87 (versus 0.97) and a concordance index (C-index) of 0.703 (versus 0.662), with consistent superior performance demonstrated in external validation trials.

Technology Category

Application Category

📝 Abstract

Precision oncology requires forecasting clinical events and trajectories, yet modeling sparse, multi-modal clinical time series remains a critical challenge. We introduce TwinWeaver, an open-source framework that serializes longitudinal patient histories into text, enabling unified event prediction as well as forecasting with large language models, and use it to build Genie Digital Twin (GDT) on 93,054 patients across 20 cancer types. In benchmarks, GDT significantly reduces forecasting error, achieving a median Mean Absolute Scaled Error (MASE) of 0.87 compared to 0.97 for the strongest time-series baseline (p<0.001). Furthermore, GDT improves risk stratification, achieving an average concordance index (C-index) of 0.703 across survival, progression, and therapy switching tasks, surpassing the best baseline of 0.662. GDT also generalizes to out-of-distribution clinical trials, matching trained baselines at zero-shot and surpassing them with fine-tuning, achieving a median MASE of 0.75-0.88 and outperforming the strongest baseline in event prediction with an average C-index of 0.672 versus 0.648. Finally, TwinWeaver enables an interpretable clinical reasoning extension, providing a scalable and transparent foundation for longitudinal clinical modeling.

Problem

Research questions and friction points this paper is trying to address.

precision oncology

clinical time series

pan-cancer

digital twins

event forecasting

Innovation

Methods, ideas, or system contributions that make the work stand out.

Large Language Models

Digital Twins

Pan-Cancer Forecasting