Fine-grained List-wise Alignment for Generative Medication Recommendation

📅 2025-05-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing medical recommendation systems predominantly employ pointwise prediction, neglecting drug synergies and drug–drug interactions (DDIs), thereby compromising recommendation accuracy and safety in comorbid patient scenarios. To address this, we propose FLAME, a fine-grained listwise drug recommendation framework that models prescription generation as a sequential decision process of incremental drug addition and removal. FLAME introduces stepwise Groupwise Relative Policy Optimization (GRPO) coupled with potential-function-based reward shaping to achieve drug-level fine-grained alignment. It further enhances patient representation by jointly incorporating structured clinical knowledge and drug synergy information. Evaluated on multiple benchmark datasets, FLAME achieves state-of-the-art performance—significantly improving both recommendation accuracy and DDI safety—while enabling controllable trade-offs between safety and accuracy and demonstrating strong generalization across diverse clinical settings.

Technology Category

Application Category

📝 Abstract
Accurate and safe medication recommendations are critical for effective clinical decision-making, especially in multimorbidity cases. However, existing systems rely on point-wise prediction paradigms that overlook synergistic drug effects and potential adverse drug-drug interactions (DDIs). We propose FLAME, a fine-grained list-wise alignment framework for large language models (LLMs), enabling drug-by-drug generation of drug lists. FLAME formulates recommendation as a sequential decision process, where each step adds or removes a single drug. To provide fine-grained learning signals, we devise step-wise Group Relative Policy Optimization (GRPO) with potential-based reward shaping, which explicitly models DDIs and optimizes the contribution of each drug to the overall prescription. Furthermore, FLAME enhances patient modeling by integrating structured clinical knowledge and collaborative information into the representation space of LLMs. Experiments on benchmark datasets demonstrate that FLAME achieves state-of-the-art performance, delivering superior accuracy, controllable safety-accuracy trade-offs, and strong generalization across diverse clinical scenarios. Our code is available at https://github.com/cxfann/Flame.
Problem

Research questions and friction points this paper is trying to address.

Overcoming point-wise prediction limitations in medication recommendation systems
Addressing synergistic drug effects and adverse drug-drug interactions (DDIs)
Enhancing patient modeling with structured clinical knowledge and LLMs
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fine-grained list-wise alignment for LLMs
Step-wise Group Relative Policy Optimization
Integrates clinical knowledge into LLMs
🔎 Similar Papers
No similar papers found.
Chenxiao Fan
Chenxiao Fan
Johns Hopkins University
C
Chongming Gao
University of Science and Technology of China
W
Wentao Shi
University of Science and Technology of China
Y
Yaxin Gong
University of Science and Technology of China
Z
Zihao Zhao
University of Science and Technology of China
F
Fuli Feng
University of Science and Technology of China