IDRL: An Individual-Aware Multimodal Depression-Related Representation Learning Framework for Depression Diagnosis

📅 2026-03-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Multimodal depression detection faces significant challenges due to modality inconsistency, interference from irrelevant information, and inter-individual variability, which hinder effective fusion. To address these issues, this work proposes the IDRL framework, which uniquely integrates modality alignment and individual differences within a unified model. Specifically, it disentangles multimodal representations into three distinct subspaces: a shared depression space, modality-specific depression spaces, and an irrelevant space. Furthermore, an Individual-Aware Fusion (IAF) module is introduced to dynamically adjust modality weights according to individual characteristics, enabling adaptive, person-specific fusion. This approach substantially enhances the extraction of depression-relevant signals and improves fusion robustness, achieving consistently superior and stable performance over existing methods in multimodal depression detection tasks.

Technology Category

Application Category

📝 Abstract
Depression is a severe mental disorder, and reliable identification plays a critical role in early intervention and treatment. Multimodal depression detection aims to improve diagnostic performance by jointly modeling complementary information from multiple modalities. Recently, numerous multimodal learning approaches have been proposed for depression analysis; however, these methods suffer from the following limitations: 1) inter-modal inconsistency and depression-unrelated interference, where depression-related cues may conflict across modalities while substantial irrelevant content obscures critical depressive signals, and 2) diverse individual depressive presentations, leading to individual differences in modality and cue importance that hinder reliable fusion. To address these issues, we propose Individual-aware Multimodal Depression-related Representation Learning Framework (IDRL) for robust depression diagnosis. Specifically, IDRL 1) disentangles multimodal representations into a modality-common depression space, a modality-specific depression space, and a depression-unrelated space to enhance modality alignment while suppressing irrelevant information, and 2) introduces an individual-aware modality-fusion module (IAF) that dynamically adjusts the weights of disentangled depression-related features based on their predictive significance, thereby achieving adaptive cross-modal fusion for different individuals. Extensive experiments demonstrate that IDRL achieves superior and robust performance for multimodal depression detection.
Problem

Research questions and friction points this paper is trying to address.

multimodal depression detection
inter-modal inconsistency
depression-unrelated interference
individual differences
modality fusion
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal representation learning
disentangled representation
individual-aware fusion
depression diagnosis
modality alignment
🔎 Similar Papers
No similar papers found.
C
Chongxiao Wang
School of Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China
J
Junjie Liang
School of Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China
Peng Cao
Peng Cao
Northeastern University
Data miningMachine learninig
J
Jinzhu Yang
School of Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China; National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China
O
Osmar R. Zaiane
Alberta Machine Intelligence Institute, University of Alberta, Edmonton, Canada