Data reuse enables cost-efficient randomized trials of medical AI models

πŸ“… 2025-11-12
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Traditional randomized controlled trials (RCTs) for medical AI models incur high costs and long durations, impeding rapid iterative validation. To address this, we propose BRIDGEβ€”a data-reuse RCT design that skips redundant intervention assessments in patient subgroups where new and legacy AI models yield concordant predictions. Leveraging real-world data from breast cancer, cardiovascular disease, and sepsis cohorts, BRIDGE integrates risk-cohort overlap analysis, causal inference testing, and rigorous Type I error control to enable adaptive, modular trial execution. This framework represents the first globally reported methodology to systematically reuse observational data in medical AI RCTs. In a breast cancer screening simulation, BRIDGE reduced enrollment by 46.6%, yielding cost savings exceeding $2.8 million while preserving 80% statistical power; prediction concordance among high-risk patients reached 64.8%.

Technology Category

Application Category

πŸ“ Abstract
Randomized controlled trials (RCTs) are indispensable for establishing the clinical value of medical artificial-intelligence (AI) tools, yet their high cost and long timelines hinder timely validation as new models emerge rapidly. Here, we propose BRIDGE, a data-reuse RCT design for AI-based risk models. AI risk models support a broad range of interventions, including screening, treatment selection, and clinical alerts. BRIDGE trials recycle participant-level data from completed trials of AI models when legacy and updated models make concordant predictions, thereby reducing the enrollment requirement for subsequent trials. We provide a practical checklist for investigators to assess whether reusing data from previous trials allows for valid causal inference and preserves type I error. Using real-world datasets across breast cancer, cardiovascular disease, and sepsis, we demonstrate concordance between successive AI models, with up to 64.8% overlap in top 5% high-risk cohorts. We then simulate a series of breast cancer screening studies, where our design reduced required enrollment by 46.6%--saving over US$2.8 million--while maintaining 80% power. By transforming trials into adaptive, modular studies, our proposed design makes Level I evidence generation feasible for every model iteration, thereby accelerating cost-effective translation of AI into routine care.
Problem

Research questions and friction points this paper is trying to address.

Reduces cost and time for clinical AI validation trials
Reuses data from prior trials when predictions are concordant
Maintains statistical validity while cutting enrollment requirements
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reuses data from completed AI model trials
Reduces enrollment needs for subsequent trials
Maintains statistical power while cutting costs
πŸ”Ž Similar Papers
No similar papers found.
M
Michael Nercessian
UC Berkeley / UCSF Computational Precision Health
W
Wenxin Zhang
UC Berkeley Biostatistics
A
Alexander Schubert
UC Berkeley / UCSF Computational Precision Health
D
Daphne Yang
UC Berkeley College of Computing, Data Science, and Society
M
Maggie Chung
UCSF Department of Radiology and Biomedical Imaging
A
Ahmed Alaa
UC Berkeley / UCSF Computational Precision Health
Adam Yala
Adam Yala
UC Berkeley, UCSF and Voio
Artificial IntelligenceMachine LearningOncologyHealthcare