Figurative-cum-Commonsense Knowledge Infusion for Multimodal Mental Health Meme Classification

📅 2025-01-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Machines struggle to interpret non-literal expressions—such as irony and metaphor—in mental health–related memes, hindering fine-grained detection of anxiety symptoms. Method: We propose M3H, the first multimodal framework for granular anxiety symptom identification, built upon a novel, GAD-scale–aligned anxiety meme dataset (AxiOM). M3H integrates ConceptNet–enhanced commonsense knowledge injection, clinical-adapted vision–language alignment, and metaphor-aware attentional fusion. Contribution/Results: On AxiOM, M3H achieves 4.20–4.66 percentage points improvement in weighted F1-score; cross-dataset evaluation on RESTORE confirms strong generalizability. Human-centered evaluation demonstrates significantly improved metaphor intention understanding, and ablation studies quantify individual module contributions. This work establishes the first commonsense-augmented modeling paradigm for mental health metaphor interpretation, advancing social media psychological risk detection toward deep semantic understanding.

Technology Category

Application Category

📝 Abstract
The expression of mental health symptoms through non-traditional means, such as memes, has gained remarkable attention over the past few years, with users often highlighting their mental health struggles through figurative intricacies within memes. While humans rely on commonsense knowledge to interpret these complex expressions, current Multimodal Language Models (MLMs) struggle to capture these figurative aspects inherent in memes. To address this gap, we introduce a novel dataset, AxiOM, derived from the GAD anxiety questionnaire, which categorizes memes into six fine-grained anxiety symptoms. Next, we propose a commonsense and domain-enriched framework, M3H, to enhance MLMs' ability to interpret figurative language and commonsense knowledge. The overarching goal remains to first understand and then classify the mental health symptoms expressed in memes. We benchmark M3H against 6 competitive baselines (with 20 variations), demonstrating improvements in both quantitative and qualitative metrics, including a detailed human evaluation. We observe a clear improvement of 4.20% and 4.66% on weighted-F1 metric. To assess the generalizability, we perform extensive experiments on a public dataset, RESTORE, for depressive symptom identification, presenting an extensive ablation study that highlights the contribution of each module in both datasets. Our findings reveal limitations in existing models and the advantage of employing commonsense to enhance figurative understanding.
Problem

Research questions and friction points this paper is trying to address.

Mental Health
Internet Memes
Machine Understanding
Innovation

Methods, ideas, or system contributions that make the work stand out.

AxiOM Dataset
M3H Methodology
Mental Health Meme Classification
🔎 Similar Papers
No similar papers found.
Abdullah Mazhar
Abdullah Mazhar
IIIT Delhi
NLPResponsible AIHealthcare
Zuhair Hasan Shaik
Zuhair Hasan Shaik
MBZUAI | ex - (MSRI, IIIT Dharwad)
Social ComputingResponsible AIInterpretability
Aseem Srivastava
Aseem Srivastava
MBZUAI
Large Language ModelsSocial InteractionsDigital Health
P
Polly Ruhnke
University of Illinois, Chicago, USA
L
Lavanya Vaddavalli
University of Illinois, Chicago, USA
S
Sri Keshav Katragadda
University of Illinois, Chicago, USA
S
Shweta Yadav
University of Illinois, Chicago, USA
Md Shad Akhtar
Md Shad Akhtar
IIIT Delhi
NLPConversational DialogMental-HealthMisinformationCode-mixed Languages