Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis

πŸ“… 2026-01-06
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge that existing large language models struggle to capture the non-monotonic symptom progression across multiple clinical visits in long-term dementia prognosis, primarily due to sparse binary reward signals and the absence of explicit symptom trajectory annotations. To overcome these limitations, the authors propose Dementia-R1, a novel framework that integrates cold-start reinforcement learning with pretraining grounded in verifiable clinical metrics. The approach first extracts longitudinal clinical signals from unstructured electronic health records for pretraining and then refines predictions of final clinical outcomes through reinforcement learning. This strategy effectively mitigates the issues of sparse rewards and complex trajectory modeling, achieving an F1 score of 77.03% on real-world data. Notably, the 7B-parameter variant matches GPT-4o’s performance on the ADNI benchmark while significantly enhancing the model’s ability to capture fluctuating cognitive trajectories.

Technology Category

Application Category

πŸ“ Abstract
While Large Language Models (LLMs) have shown strong performance on clinical text understanding, they struggle with longitudinal prediction tasks such as dementia prognosis, which require reasoning over complex, non-monotonic symptom trajectories across multiple visits. Standard supervised training lacks explicit annotations for symptom evolution, while direct Reinforcement Learning (RL) is hindered by sparse binary rewards. To address this challenge, we introduce Dementia-R1, an RL-based framework for longitudinal dementia prognosis from unstructured clinical notes. Our approach adopts a Cold-Start RL strategy that pre-trains the model to predict verifiable clinical indices extracted from patient histories, enhancing the capability to reason about disease progression before determining the final clinical status. Extensive experiments demonstrate that Dementia-R1 achieves an F1 score of 77.03% on real-world unstructured clinical datasets. Notably, on the ADNI benchmark, our 7B model rivals GPT-4o, effectively capturing fluctuating cognitive trajectories. Code is available at https://anonymous.4open.science/r/dementiar1-CDB5
Problem

Research questions and friction points this paper is trying to address.

dementia prognosis
longitudinal prediction
clinical notes
symptom trajectories
non-monotonic reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement Learning
Longitudinal Prognosis
Clinical Reasoning
Cold-Start RL
Unstructured Clinical Notes
πŸ”Ž Similar Papers
C
Choonghan Kim
Graduate School of AI, KAIST, Republic of Korea
H
Hyunmin Hwang
Graduate School of AI, KAIST, Republic of Korea
H
Hangeol Chang
Graduate School of AI, KAIST, Republic of Korea
Jaemin Kim
Jaemin Kim
KAIST AI
J
Jinse Park
Haeundae Paik Hospital, Inje University, Republic of Korea
J
Jae-Sung Lim
Asan Medical Center, University of Ulsan College of Medicine, Republic of Korea
Jong Chul Ye
Jong Chul Ye
Professor, Chung Moon Soul Chair, Graduate School of AI, KAIST
machine learningcomputational imagingmedical imagingsignal processingcompressed sensing