Unveiling the Landscape of Clinical Depression Assessment: From Behavioral Signatures to Psychiatric Reasoning

📅 2025-08-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing depression assessment methods rely heavily on non-clinical data and employ complex models with limited clinical deployability. Method: This work proposes a clinically grounded, multimodal depression assessment framework tailored to real-world diagnostic settings. We introduce the C-MIND dataset—comprising synchronized audio, video, transcribed text, and functional near-infrared spectroscopy (fNIRS) neuroimaging signals—and design a clinical-knowledge-guided large language model (LLM) inference mechanism that jointly integrates behavioral analysis and psychiatric diagnostic reasoning. Furthermore, we systematically quantify the contribution of each modality to the diagnostic task. Results: Evaluated on authentic clinical data, our approach achieves a 10% improvement in Macro-F1 score over prior methods, while significantly enhancing model interpretability and practical deployability. To our knowledge, this is the first automated depression assessment framework grounded in real clinical workflows, balancing diagnostic reliability with clinical utility.

Technology Category

Application Category

📝 Abstract
Depression is a widespread mental disorder that affects millions worldwide. While automated depression assessment shows promise, most studies rely on limited or non-clinically validated data, and often prioritize complex model design over real-world effectiveness. In this paper, we aim to unveil the landscape of clinical depression assessment. We introduce C-MIND, a clinical neuropsychiatric multimodal diagnosis dataset collected over two years from real hospital visits. Each participant completes three structured psychiatric tasks and receives a final diagnosis from expert clinicians, with informative audio, video, transcript, and functional near-infrared spectroscopy (fNIRS) signals recorded. Using C-MIND, we first analyze behavioral signatures relevant to diagnosis. We train a range of classical models to quantify how different tasks and modalities contribute to diagnostic performance, and dissect the effectiveness of their combinations. We then explore whether LLMs can perform psychiatric reasoning like clinicians and identify their clear limitations in realistic clinical settings. In response, we propose to guide the reasoning process with clinical expertise and consistently improves LLM diagnostic performance by up to 10% in Macro-F1 score. We aim to build an infrastructure for clinical depression assessment from both data and algorithmic perspectives, enabling C-MIND to facilitate grounded and reliable research for mental healthcare.
Problem

Research questions and friction points this paper is trying to address.

Improving automated depression assessment with clinically validated data
Analyzing behavioral signatures and multimodal data for diagnosis
Enhancing LLM psychiatric reasoning using clinical expertise guidance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal dataset with fNIRS and behavioral data
Clinical expertise-guided LLM reasoning enhancement
Task-modality combination effectiveness analysis
🔎 Similar Papers
No similar papers found.
Zhuang Chen
Zhuang Chen
中南大学计算机学院
Natural Language ProcessingSocial IntelligenceComputational Psychology
Guanqun Bi
Guanqun Bi
Tsinghua University; UCAS
Social AgentsNatural Language Generation
W
Wen Zhang
University of International Relations
Jiawei Hu
Jiawei Hu
PhD Student, University of New South Wales
Mobile ComputingUbiquitous Computing
A
Aoyun Wang
School of Computer Science and Engineering, Central South University
X
Xiyao Xiao
Lingxin AI
Kun Feng
Kun Feng
Illinois Institute of Technology
M
Minlie Huang
CoAI Group, DCST, IAI, BNRIST, Tsinghua University