Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry

📅 2025-05-05
📈 Citations: 0
Influential: 0
📄 PDF

career value

174K/year
🤖 AI Summary
Clinical large language models (LLMs) suffer from limited clinical reasoning capabilities due to insufficient training on real-world, high-fidelity clinical data. To address this, we propose C-Reason—a novel framework that constructs reasoning-intensive questions from the nationally representative, privacy-compliant, high-quality, and large-scale Sepsis Registry, and fine-tunes a lightweight Phi-4 model via Reinforcement Learning from Human Feedback (RLHF). This work marks the first demonstration of real-world sepsis data–driven clinical reasoning enhancement in LLMs, yielding statistically significant performance gains on domain-specific benchmarks (p < 0.01) and receiving strong endorsement from clinical experts. Moreover, C-Reason exhibits robust generalization across tasks (e.g., antibiotic consultation), patient populations, and disease conditions (multi-disease reasoning). Its core innovation lies in systematically integrating high-fidelity real-world clinical data into the LLM reasoning training loop—establishing a reproducible, scalable data–algorithm co-design paradigm for clinical AI.

Technology Category

Application Category

📝 Abstract
Although large language models (LLMs) have demonstrated impressive reasoning capabilities across general domains, their effectiveness in real-world clinical practice remains limited. This is likely due to their insufficient exposure to real-world clinical data during training, as such data is typically not included due to privacy concerns. To address this, we propose enhancing the clinical reasoning capabilities of LLMs by leveraging real-world clinical data. We constructed reasoning-intensive questions from a nationwide sepsis registry and fine-tuned Phi-4 on these questions using reinforcement learning, resulting in C-Reason. C-Reason exhibited strong clinical reasoning capabilities on the in-domain test set, as evidenced by both quantitative metrics and expert evaluations. Furthermore, its enhanced reasoning capabilities generalized to a sepsis dataset involving different tasks and patient cohorts, an open-ended consultations on antibiotics use task, and other diseases. Future research should focus on training LLMs with large-scale, multi-disease clinical datasets to develop more powerful, general-purpose clinical reasoning models.
Problem

Research questions and friction points this paper is trying to address.

Enhancing LLMs' clinical reasoning with real-world sepsis data
Addressing limited LLM effectiveness in clinical practice
Improving generalizability of clinical reasoning across diseases
Innovation

Methods, ideas, or system contributions that make the work stand out.

Leveraging nationwide sepsis registry data
Fine-tuning Phi-4 with reinforcement learning
Developing C-Reason for clinical reasoning
🔎 Similar Papers
No similar papers found.
J
Junu Kim
Korea Advanced Institute of Science and Technology
C
Chaeeun Shim
Korea Advanced Institute of Science and Technology
Sungjin Park
Sungjin Park
KAIST, PhD student
Machine Learning for Biomedical dataNatural Language Processing
S
Su Yeon Lee
Asan Medical Center, University of Ulsan College of Medicine
G
G. Suh
Samsung Medical Center, Sungkyunkwan University School of Medicine
C
Chae-Man Lim
Asan Medical Center, University of Ulsan College of Medicine
S
Seong Jin Choi
Seoul National University Bundang Hospital, Seoul National University College of Medicine
S
Song Mi Moon
Seoul National University Bundang Hospital, Seoul National University College of Medicine
K
Kyoung-Ho Song
Seoul National University Bundang Hospital, Seoul National University College of Medicine
E
Eu Suk Kim
Seoul National University Bundang Hospital, Seoul National University College of Medicine
H
Hong Bin Kim
Seoul National University Bundang Hospital, Seoul National University College of Medicine
Sejoong Kim
Sejoong Kim
Seoul National University Bundang Hospital, Seoul National University College of Medicine
C
Chami Im
Seoul National University Bundang Hospital, Seoul National University College of Medicine
D
Dong-Wan Kang
Seoul National University Bundang Hospital, Seoul National University College of Medicine
Yong Soo Kim
Yong Soo Kim
Seoul National University Bundang Hospital, Seoul National University College of Medicine
Hee-Joon Bae
Hee-Joon Bae
Seoul National University Bundang Hospital, Seoul National University College of Medicine
S
Sung Yoon Lim
Seoul National University Bundang Hospital, Seoul National University College of Medicine
H
Han-Gil Jeong
Seoul National University Bundang Hospital, Seoul National University College of Medicine
Edward Choi
Edward Choi
KAIST
Machine LearningArtificial IntelligenceHealthcare