π€ AI Summary
Traditional randomized controlled trials (RCTs) for medical AI models incur high costs and long durations, impeding rapid iterative validation. To address this, we propose BRIDGEβa data-reuse RCT design that skips redundant intervention assessments in patient subgroups where new and legacy AI models yield concordant predictions. Leveraging real-world data from breast cancer, cardiovascular disease, and sepsis cohorts, BRIDGE integrates risk-cohort overlap analysis, causal inference testing, and rigorous Type I error control to enable adaptive, modular trial execution. This framework represents the first globally reported methodology to systematically reuse observational data in medical AI RCTs. In a breast cancer screening simulation, BRIDGE reduced enrollment by 46.6%, yielding cost savings exceeding $2.8 million while preserving 80% statistical power; prediction concordance among high-risk patients reached 64.8%.
π Abstract
Randomized controlled trials (RCTs) are indispensable for establishing the clinical value of medical artificial-intelligence (AI) tools, yet their high cost and long timelines hinder timely validation as new models emerge rapidly. Here, we propose BRIDGE, a data-reuse RCT design for AI-based risk models. AI risk models support a broad range of interventions, including screening, treatment selection, and clinical alerts. BRIDGE trials recycle participant-level data from completed trials of AI models when legacy and updated models make concordant predictions, thereby reducing the enrollment requirement for subsequent trials. We provide a practical checklist for investigators to assess whether reusing data from previous trials allows for valid causal inference and preserves type I error. Using real-world datasets across breast cancer, cardiovascular disease, and sepsis, we demonstrate concordance between successive AI models, with up to 64.8% overlap in top 5% high-risk cohorts. We then simulate a series of breast cancer screening studies, where our design reduced required enrollment by 46.6%--saving over US$2.8 million--while maintaining 80% power. By transforming trials into adaptive, modular studies, our proposed design makes Level I evidence generation feasible for every model iteration, thereby accelerating cost-effective translation of AI into routine care.