Realistic CDSS Drug Dosing with End-to-end Recurrent Q-learning for Dual Vasopressor Control

📅 2025-10-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Reinforcement learning (RL) policies for dual-vasopressor titration in septic shock patients suffer from clinical impracticality and poor interpretability. Method: We propose an end-to-end RL framework featuring: (1) a discrete–continuous–directional coupled action space to enhance clinical operability; (2) integration of offline conservative Q-learning with recurrent neural networks to model temporal dependencies in the replay buffer, thereby improving policy stability and interpretability; and (3) joint training and validation on multi-center ICU time-series data from eICU and MIMIC. Results: Our method significantly improves 28-day survival probability by +15.2% on real-world data and generates treatment policies fully compliant with clinical guidelines. It establishes a novel, trustworthy paradigm for deploying clinical decision support systems (CDSS) in critical-care dual-vasopressor management.

Technology Category

Application Category

📝 Abstract
Reinforcement learning (RL) applications in Clinical Decision Support Systems (CDSS) frequently encounter skepticism from practitioners regarding inoperable dosing decisions. We address this challenge with an end-to-end approach for learning optimal drug dosing and control policies for dual vasopressor administration in intensive care unit (ICU) patients with septic shock. For realistic drug dosing, we apply action space design that accommodates discrete, continuous, and directional dosing strategies in a system that combines offline conservative Q-learning with a novel recurrent modeling in a replay buffer to capture temporal dependencies in ICU time-series data. Our comparative analysis of norepinephrine dosing strategies across different action space formulations reveals that the designed action spaces improve interpretability and facilitate clinical adoption while preserving efficacy. Empirical results1 on eICU and MIMIC demonstrate that action space design profoundly influences learned behavioral policies. The proposed methods achieve improved patient outcomes of over 15% in survival improvement probability, while aligning with established clinical protocols.
Problem

Research questions and friction points this paper is trying to address.

Developing realistic dual vasopressor dosing policies for ICU patients
Addressing clinical skepticism through interpretable action space design
Improving patient survival probability while maintaining protocol compliance
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end recurrent Q-learning for dual vasopressor control
Action space design with discrete continuous directional dosing
Offline conservative Q-learning with recurrent replay buffer modeling
🔎 Similar Papers
No similar papers found.
W
Will Y. Zou
Angle.ac
Jean Feng
Jean Feng
Department of Epidemiology and Biostatistics, University of California, San Francisco
machine learningstatisticsbiostatistics
A
Alexandre Kalimouttou
University of California, San Francisco
J
Jennifer Yuntong Zhang
Engineering Science, University of Toronto
C
Christopher W. Seymour
University of Pittsburgh
Romain Pirracchio
Romain Pirracchio
UCSF
MedecineAIBiostatistics