Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages

📅 2025-03-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Early identification of comorbid depression/anxiety in diabetic patients remains clinically challenging, necessitating reliable, low-resource NLP solutions for real-world clinical messaging. Method: We systematically evaluate multiple large language models (LLMs) on zero-shot and few-shot mental health symptom detection using authentic, de-identified clinical safety messages. We propose a lightweight adaptation framework integrating engineered prompting, systematic role-based modeling, temperature tuning, and semantic parsing of the PHQ-4 scale to enable fine-grained symptom classification. Contribution/Results: Llama 3.1 405B achieves 93% F1 score and accuracy; three leading LLMs all exceed 90% in both metrics—demonstrating high reliability for binary classification and complex psychometric scale interpretation. This is the first study to rigorously validate LLMs for mental health screening in authentic clinical message contexts. Our framework establishes a reproducible, resource-efficient adaptation paradigm for low-resource medical NLP applications.

Technology Category

Application Category

📝 Abstract
Patients with diabetes are at increased risk of comorbid depression or anxiety, complicating their management. This study evaluated the performance of large language models (LLMs) in detecting these symptoms from secure patient messages. We applied multiple approaches, including engineered prompts, systemic persona, temperature adjustments, and zero-shot and few-shot learning, to identify the best-performing model and enhance performance. Three out of five LLMs demonstrated excellent performance (over 90% of F-1 and accuracy), with Llama 3.1 405B achieving 93% in both F-1 and accuracy using a zero-shot approach. While LLMs showed promise in binary classification and handling complex metrics like Patient Health Questionnaire-4, inconsistencies in challenging cases warrant further real-life assessment. The findings highlight the potential of LLMs to assist in timely screening and referrals, providing valuable empirical knowledge for real-world triage systems that could improve mental health care for patients with chronic diseases.
Problem

Research questions and friction points this paper is trying to address.

Detecting depression or anxiety symptoms in chronic disease patients
Optimizing large language models for symptom detection
Improving mental health care through timely screening and referrals
Innovation

Methods, ideas, or system contributions that make the work stand out.

Engineered prompts and systemic persona used
Zero-shot and few-shot learning applied
Temperature adjustments enhanced model performance
🔎 Similar Papers
No similar papers found.
J
Jiyeong Kim
Stanford Center for Digital Health, Department of Medicine, Stanford University, Stanford, CA, USA
S
Stephen P. Ma
Division of Hospital Medicine, Stanford School of Medicine, Stanford, CA; Technology and Digital Solutions, School of Medicine, Stanford University, Stanford, CA, USA; Stanford Health Care, Palo Alto, CA, USA
M
Michael L. Chen
Stanford Center for Digital Health, Department of Medicine, Stanford University, Stanford, CA, USA
Isaac R. Galatzer-Levy
Isaac R. Galatzer-Levy
Google DeepMind, NYU School of Medicine
Health Tech DevelopmentNeuroscienceAIDigital HealthPost-traumatic Stress and Resilence
John Torous
John Torous
Harvard Medical School / Beth Israel Deaconess Medical Center
Clinical InformaticsDigital PhenotypingSmartphonesMental HealthPsychiatry
P
Peter J. van Roessel
Department of Psychiatry and Behavioral Sciences, School of Medicine, Stanford University, Palo Alto, CA, USA
C
Christopher Sharp
Division of Hospital Medicine, Stanford School of Medicine, Stanford, CA; Technology and Digital Solutions, School of Medicine, Stanford University, Stanford, CA, USA; Stanford Health Care, Palo Alto, CA, USA
M
Michael A Pfeffer
Division of Hospital Medicine, Stanford School of Medicine, Stanford, CA; Technology and Digital Solutions, School of Medicine, Stanford University, Stanford, CA, USA; Stanford Health Care, Palo Alto, CA, USA
C
Carolyn I. Rodriguez
Department of Psychiatry and Behavioral Sciences, School of Medicine, Stanford University, Palo Alto, CA, USA; Veterans Affairs Palo Alto Health Care System, Palo Alto, CA, USA
Eleni Linos
Eleni Linos
Stanford University
EpidemiologyDermatologyPublic Health
J
Jonathan H. Chen
Division of Hospital Medicine, Stanford School of Medicine, Stanford, CA; Technology and Digital Solutions, School of Medicine, Stanford University, Stanford, CA, USA; Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA