🤖 AI Summary
Over 36 million people globally suffer from substance use disorders (SUDs), yet treatment accessibility and efficacy remain severely limited by stigma, low patient motivation, and insufficient personalized support. To address this, we propose the first clinically aligned multi-agent dialogue framework for addiction recovery, integrating cognitive behavioral therapy (CBT) and motivational interviewing (MI). We construct a high-fidelity, resistance-graded synthetic benchmark and adopt a two-stage training paradigm—supervised fine-tuning followed by direct preference optimization—incorporating dynamic patient modeling, context-aware dialogue generation, and adaptive persuasive strategies. Experiments demonstrate that our system increases average patient treatment motivation by 41.5% and self-efficacy by 0.49 points, reduces resolution cycles for challenging cases by 26% compared to GPT-4o, and significantly outperforms existing models in empathy, responsiveness, and behavioral authenticity. Clinical validity is confirmed through both automated and expert clinician evaluations.
📝 Abstract
Substance use disorders (SUDs) affect over 36 million people worldwide, yet few receive effective care due to stigma, motivational barriers, and limited personalized support. Although large language models (LLMs) show promise for mental-health assistance, most systems lack tight integration with clinically validated strategies, reducing effectiveness in addiction recovery. We present ChatThero, a multi-agent conversational framework that couples dynamic patient modeling with context-sensitive therapeutic dialogue and adaptive persuasive strategies grounded in cognitive behavioral therapy (CBT) and motivational interviewing (MI). We build a high-fidelity synthetic benchmark spanning Easy, Medium, and Hard resistance levels, and train ChatThero with a two-stage pipeline comprising supervised fine-tuning (SFT) followed by direct preference optimization (DPO). In evaluation, ChatThero yields a 41.5% average gain in patient motivation, a 0.49% increase in treatment confidence, and resolves hard cases with 26% fewer turns than GPT-4o, and both automated and human clinical assessments rate it higher in empathy, responsiveness, and behavioral realism. The framework supports rigorous, privacy-preserving study of therapeutic conversation and provides a robust, replicable basis for research and clinical translation.