Empowering Clinical Trial Design through AI: A Randomized Evaluation of PowerGPT

📅 2025-09-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Clinical trial sample size calculation requires specialized statistical expertise, posing a significant practical barrier for non-statistician researchers. To address this, we propose PowerGPT—the first AI system integrating large language models (LLMs) with a domain-specific statistical engine—enabling automated identification of appropriate statistical tests, parsing of hypothesis specifications, and accurate sample size estimation for diverse procedures (e.g., t-tests, ANOVA, survival analysis) directly from natural language input. Its key innovation lies in constructing an interpretable statistical decision chain that bridges the semantic gap between clinical domain knowledge and formal statistical methodology. In a randomized controlled evaluation, PowerGPT achieved a 99.3% task completion rate and 94.1% accuracy, reducing average computation time from 9.3 to 4.0 minutes. These results demonstrate substantial improvements in accessibility, efficiency, and reliability of clinical trial design.

Technology Category

Application Category

📝 Abstract
Sample size calculations for power analysis are critical for clinical research and trial design, yet their complexity and reliance on statistical expertise create barriers for many researchers. We introduce PowerGPT, an AI-powered system integrating large language models (LLMs) with statistical engines to automate test selection and sample size estimation in trial design. In a randomized trial to evaluate its effectiveness, PowerGPT significantly improved task completion rates (99.3% vs. 88.9% for test selection, 99.3% vs. 77.8% for sample size calculation) and accuracy (94.1% vs. 55.4% in sample size estimation, p < 0.001), while reducing average completion time (4.0 vs. 9.3 minutes, p < 0.001). These gains were consistent across various statistical tests and benefited both statisticians and non-statisticians as well as bridging expertise gaps. Already under deployment across multiple institutions, PowerGPT represents a scalable AI-driven approach that enhances accessibility, efficiency, and accuracy in statistical power analysis for clinical research.
Problem

Research questions and friction points this paper is trying to address.

Automating complex statistical power analysis for clinical trials
Reducing reliance on statistical expertise in trial design
Improving accuracy and efficiency in sample size calculations
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-powered system automating test selection
Integrates large language models with statistical engines
Enhances accessibility, efficiency, and accuracy
🔎 Similar Papers
No similar papers found.
Y
Yiwen Lu
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
L
Lu Li
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Dazheng Zhang
Dazheng Zhang
PhD in Biostatistics, University of Pennsylvania
Real-World Evidence (RWE)Clinical TrialElectronic Health RecordsBiostatistics
X
Xinyao Jian
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
T
Tingyin Wang
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
S
Siqi Chen
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Y
Yuqing Lei
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Jiayi Tong
Jiayi Tong
Assistant Professor, Department of Biostatistics, Johns Hopkins University
BiostatisticsBiomedical informaticsReal-world evidence (RWE)Meta-analysis
Zhaohan Xi
Zhaohan Xi
Binghamton University
AI for ScienceLarge Language ModelsHealthcare AICybersecurityAI Security
Haitao Chu
Haitao Chu
Professor of Biostatistics, University of Minnesota
Bayesian InferencePrecision MedicineBiostatisticsEpidemiology MethodsMeta-analysis
Chongliang Luo
Chongliang Luo
Washington University in St Louis
data integrationdistributed statistical learningmeta-analysisvaccine and drug safety
A
Alexis Ogdie
Department of Medicine and Rheumatology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Brian Athey
Brian Athey
Professor of Computational Medicine and Bioinformatics'
Bioinformaticsmicroscopychromatin
A
Alparslan Turan
Department of Anesthesiology, Critical Care and Pain Medicine, McGovern Medical School, Houston, TX, USA
M
Michael Abramoff
Department of Ophthalmology and Visual Sciences, University of Iowa Hospital and Clinics, Iowa City, IA, USA
J
Joseph C Cappelleri
Statistical Research and Data Science Center, Pfizer Inc, Groton, CT, USA
H
Hua Xu
Department of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT, USA
Y
Yun Lu
Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
J
Jesse Berlin
Epidemiology, Rutgers the State University of New Jersey, New Brunswick, NJ, USA
D
Daniel I. Sessler
Department of Anesthesiology, Critical Care and Pain Medicine, McGovern Medical School, Houston, TX, USA
D
David A. Asch
Department of Medical Ethics and Health Policy, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Xiaoqian Jiang
Xiaoqian Jiang
McWilliams School of Biomedical Informatics, UTHealth
predictive modelinghealthcare privacy
Y
Yong Chen
Center for Health AI and Synthesis of Evidence (CHASE), Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA