Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework

📅 2025-05-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the lack of objective assessment tools for client engagement quality in therapeutic dialogues. We propose the first NLP evaluation framework integrating four dimensions: dialogue dynamics, semantic alignment, affective expression, and question identification. For the first time, we systematically quantify 42 linguistic features—revealing that statistical properties (mean and standard deviation) of client utterances are the most predictive indicators of engagement quality. Using Random Forest, CatBoost, and SVM classifiers, we employ stratified 5-fold cross-validation with SMOTE-Tomek hybrid oversampling/undersampling on clinical counseling transcripts. The framework achieves 88.9% accuracy, 90.0% F1-score, and 94.6% AUC. It significantly outperforms baseline models in F1-score and recall, demonstrates potential for real-time clinical feedback, and is extensible to multimodal inputs—including speech prosody and facial expressions.

Technology Category

Application Category

📝 Abstract
Engagement between client and therapist is a critical determinant of therapeutic success. We propose a multi-dimensional natural language processing (NLP) framework that objectively classifies engagement quality in counseling sessions based on textual transcripts. Using 253 motivational interviewing transcripts (150 high-quality, 103 low-quality), we extracted 42 features across four domains: conversational dynamics, semantic similarity as topic alignment, sentiment classification, and question detection. Classifiers, including Random Forest (RF), Cat-Boost, and Support Vector Machines (SVM), were hyperparameter tuned and trained using a stratified 5-fold cross-validation and evaluated on a holdout test set. On balanced (non-augmented) data, RF achieved the highest classification accuracy (76.7%), and SVM achieved the highest AUC (85.4%). After SMOTE-Tomek augmentation, performance improved significantly: RF achieved up to 88.9% accuracy, 90.0% F1-score, and 94.6% AUC, while SVM reached 81.1% accuracy, 83.1% F1-score, and 93.6% AUC. The augmented data results reflect the potential of the framework in future larger-scale applications. Feature contribution revealed conversational dynamics and semantic similarity between clients and therapists were among the top contributors, led by words uttered by the client (mean and standard deviation). The framework was robust across the original and augmented datasets and demonstrated consistent improvements in F1 scores and recall. While currently text-based, the framework supports future multimodal extensions (e.g., vocal tone, facial affect) for more holistic assessments. This work introduces a scalable, data-driven method for evaluating engagement quality of the therapy session, offering clinicians real-time feedback to enhance the quality of both virtual and in-person therapeutic interactions.
Problem

Research questions and friction points this paper is trying to address.

Classifying engagement quality in therapy sessions using NLP
Evaluating conversational dynamics and semantic similarity features
Providing real-time feedback to improve therapeutic interactions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-dimensional NLP framework for therapy engagement
Feature extraction across four conversational domains
Enhanced classifier performance with SMOTE-Tomek augmentation
A
Alice Rueda
St. Michael’s Hospital, Unity Health Toronto
A
Argyrios Perivolaris
St. Michael’s Hospital, Unity Health Toronto
N
Niloy Roy
Toronto Metropolitan University
D
Dylan Weston
Toronto Metropolitan University
S
Sarmed Shaya
Toronto Metropolitan University
Z
Zachary Cote
Toronto Metropolitan University
M
Martin Ivanov
St. Michael’s Hospital, Unity Health Toronto
B
Bazen Gashaw Teferra
St. Michael’s Hospital, Unity Health Toronto
Yuqi Wu
Yuqi Wu
PhD , University of Alberta, Fudan University
Natural Language ProcessingComputational PsychiatryLarge Language Models
Sirisha Rambhatla
Sirisha Rambhatla
Assistant Professor at the University of Waterloo
Machine LearningStatistical Signal ProcessingOptimizationAI for Healthcare
Divya Sharma
Divya Sharma
York University
A
Andrew J Greenshaw
University of Alberta
R
Rakesh Jetly
Institute of Mental Health Research, University of Ottawa
Y
Yanbo Zhang
University of Alberta
B
Bo Cao
University of Alberta
Reza Samavi
Reza Samavi
Associate Professor, Toronto Metropoiltan University
Security and PrivacyMachine Learning
S
Sri Krishnan
Toronto Metropolitan University
V
Venkat Bhat
St. Michael’s Hospital, Unity Health Toronto