TwinWeaver: An LLM-Based Foundation Model Framework for Pan-Cancer Digital Twins

📅 2026-01-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of effectively modeling sparse, multimodal clinical time-series data for predicting patient events and disease trajectories in precision oncology. To this end, the authors propose the Pan-Cancer Digital Twin (GDT) framework, which, for the first time, leverages large language models (LLMs) to model longitudinal patient histories as textual sequences, enabling unified clinical event prediction and trajectory inference. The approach supports zero-shot generalization to clinical trial data and provides interpretable clinical reasoning. Evaluated on a cohort of 93,054 patients, GDT significantly outperforms baseline methods, achieving a mean absolute scaled error (MASE) of 0.87 (versus 0.97) and a concordance index (C-index) of 0.703 (versus 0.662), with consistent superior performance demonstrated in external validation trials.

Technology Category

Application Category

📝 Abstract
Precision oncology requires forecasting clinical events and trajectories, yet modeling sparse, multi-modal clinical time series remains a critical challenge. We introduce TwinWeaver, an open-source framework that serializes longitudinal patient histories into text, enabling unified event prediction as well as forecasting with large language models, and use it to build Genie Digital Twin (GDT) on 93,054 patients across 20 cancer types. In benchmarks, GDT significantly reduces forecasting error, achieving a median Mean Absolute Scaled Error (MASE) of 0.87 compared to 0.97 for the strongest time-series baseline (p<0.001). Furthermore, GDT improves risk stratification, achieving an average concordance index (C-index) of 0.703 across survival, progression, and therapy switching tasks, surpassing the best baseline of 0.662. GDT also generalizes to out-of-distribution clinical trials, matching trained baselines at zero-shot and surpassing them with fine-tuning, achieving a median MASE of 0.75-0.88 and outperforming the strongest baseline in event prediction with an average C-index of 0.672 versus 0.648. Finally, TwinWeaver enables an interpretable clinical reasoning extension, providing a scalable and transparent foundation for longitudinal clinical modeling.
Problem

Research questions and friction points this paper is trying to address.

precision oncology
clinical time series
pan-cancer
digital twins
event forecasting
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large Language Models
Digital Twins
Pan-Cancer Forecasting
Clinical Time Series
Interpretable AI
🔎 Similar Papers
No similar papers found.
N
Nikita Makarov
Computational Sciences Center of Excellence, Roche, Penzberg, Germany; Computational Health Center, Helmholtz Munich, Munich, Germany; Department of Biology, Ludwig Maximilian University of Munich, Munich, Germany
M
Maria Bordukova
Computational Sciences Center of Excellence, Roche, Penzberg, Germany; Computational Health Center, Helmholtz Munich, Munich, Germany; Department of Biology, Ludwig Maximilian University of Munich, Munich, Germany
L
Lena Voith von Voithenberg
Early Development Oncology, Roche Innovation Center Zurich, Roche, Schlieren, Switzerland
E
Estrella Pivel-Villanueva
Computational Sciences Center of Excellence, Roche, Penzberg, Germany
S
Sabrina Mielke
Computational Sciences Center of Excellence, Genentech, New York City, USA
J
Jonathan Wickes
Computational Sciences Center of Excellence, Genentech, South San Francisco, USA
H
Hanchen Wang
Computational Sciences Center of Excellence, Genentech, South San Francisco, USA; Department of Computer Science, Stanford University, Stanford, CA, USA
Mingyu Derek Ma
Mingyu Derek Ma
Prescient Design, Genentech/Roche
Generative Language ModelsLLM AgentsNatural Language ProcessingMachine LearningAI4Science
Keunwoo Choi
Keunwoo Choi
talkpl.ai
Music Information RetrievalMachine LearningLanguage Models
Kyunghyun Cho
Kyunghyun Cho
New York University, Genentech
Machine LearningDeep Learning
Stephen Ra
Stephen Ra
Senior Director, Prescient Design, Genentech
Machine learningStatisticsNeuroscience
R
Raul Rodriguez-Esteban
Computational Sciences Center of Excellence, Roche, Basel, Switzerland
F
Fabian Schmich
Computational Sciences Center of Excellence, Roche, Penzberg, Germany
M
Michael Menden
Computational Health Center, Helmholtz Munich, Munich, Germany; Department of Biochemistry and Pharmacology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, Australia