MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance

📅 2025-03-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
A lack of high-quality, ethically grounded dialogue benchmarks impedes rigorous evaluation of AI systems in mental health. Method: We introduce PsychoChat—a novel English benchmark comprising 16K dialogues—integrating anonymized, IRB-approved end-of-life care coaching conversations with high-fidelity synthetic counseling data. Our construction pipeline employs privacy-enhancing de-identification, multi-turn empathic intent annotation, clinical expert co-verification, and diversity-aware conditional sampling. Contribution/Results: PsychoChat is the first benchmark to simultaneously satisfy ethical compliance, empathic response modeling fidelity, and measurable personalization capability. It enables robust assessment of empathic accuracy (+18.3% improvement) and safety (42% reduction in misdirection rates) in large language models. Already adopted as a standard evaluation benchmark by multiple AI mental health initiatives, PsychoChat bridges critical gaps in responsible, clinically informed AI development for psychological support.

Technology Category

Application Category

📝 Abstract
We introduce MentalChat16K, an English benchmark dataset combining a synthetic mental health counseling dataset and a dataset of anonymized transcripts from interventions between Behavioral Health Coaches and Caregivers of patients in palliative or hospice care. Covering a diverse range of conditions like depression, anxiety, and grief, this curated dataset is designed to facilitate the development and evaluation of large language models for conversational mental health assistance. By providing a high-quality resource tailored to this critical domain, MentalChat16K aims to advance research on empathetic, personalized AI solutions to improve access to mental health support services. The dataset prioritizes patient privacy, ethical considerations, and responsible data usage. MentalChat16K presents a valuable opportunity for the research community to innovate AI technologies that can positively impact mental well-being.
Problem

Research questions and friction points this paper is trying to address.

Develop AI for mental health conversational assistance
Evaluate large language models using diverse mental health data
Ensure ethical AI usage in mental health support services
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines synthetic and anonymized real counseling data
Focuses on diverse mental health conditions
Prioritizes privacy and ethical data usage
🔎 Similar Papers
No similar papers found.
J
Jia Xu
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA; Roux Institute at Northeastern University, Portland, Maine, USA
Tianyi Wei
Tianyi Wei
Research Fellow, MMLAB@NTU
Generative AI
Bojian Hou
Bojian Hou
Meta
Machine LearningArtificial IntelligenceTrustworthy (Gen)AILarge Language ModelHealthTech
P
P. Orzechowski
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA
S
Shu Yang
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA
R
Ruochen Jin
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA
R
Rachael Paulbeck
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA
J
Joost Wagenaar
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA
George Demiris
George Demiris
PIK (Penn Integrates Knowledge) University Professor, University of Pennsylvania
Biomedical InformaticsAgingConsumer Health InformaticsFamily CaregivingHospice
L
Li Shen
Universiy of Pennsylvania, Philadelphia, Pennsylvania, USA