Developing Conversational Speech Systems for Robots to Detect Speech Biomarkers of Cognition in People Living with Dementia

πŸ“… 2025-02-15
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Real-time assessment of speech-based biomarkers for early dementia screening remains challenging in clinical practice. Method: This study introduces the first clinically oriented conversational speech analysis robot system, integrating a fine-tuned domain-specific large language model (dementia-LLM) with a low-latency WebSocket architecture. The frontend employs a Progressive Web App (PWA), while the backend combines speech feature engineering with six-dimensional cognitive-relevant biomarker modeling, culminating in a Composite Voice-Based Score (CVBS). Contribution/Results: The system achieves end-to-end, real-time (response latency <1.5 s), and interpretable quantification of cognitive statusβ€”the first of its kind. On the DementiaBank corpus, CVBS demonstrates moderate correlation with the Mini-Mental State Examination (MMSE; r = 0.58), significantly outperforming individual speech biomarkers. Additionally, the system supports automated, clinician-friendly visualization reports, underscoring its readiness for clinical deployment.

Technology Category

Application Category

πŸ“ Abstract
This study presents the development and testing of a conversational speech system designed for robots to detect speech biomarkers indicative of cognitive impairments in people living with dementia (PLwD). The system integrates a backend Python WebSocket server and a central core module with a large language model (LLM) fine-tuned for dementia to process user input and generate robotic conversation responses in real-time in less than 1.5 seconds. The frontend user interface, a Progressive Web App (PWA), displays information and biomarker score graphs on a smartphone in real-time to human users (PLwD, caregivers, clinicians). Six speech biomarkers based on the existing literature - Altered Grammar, Pragmatic Impairments, Anomia, Disrupted Turn-Taking, Slurred Pronunciation, and Prosody Changes - were developed for the robot conversation system using two datasets, one that included conversations of PLwD with a human clinician (DementiaBank dataset) and one that included conversations of PLwD with a robot (Indiana dataset). We also created a composite speech biomarker that combined all six individual biomarkers into a single score. The speech system's performance was first evaluated on the DementiaBank dataset showing moderate correlation with MMSE scores, with the composite biomarker score outperforming individual biomarkers. Analysis of the Indiana dataset revealed higher and more variable biomarker scores, suggesting potential differences due to study populations (e.g. severity of dementia) and the conversational scenario (human-robot conversations are different from human-human). The findings underscore the need for further research on the impact of conversational scenarios on speech biomarkers and the potential clinical applications of robotic speech systems.
Problem

Research questions and friction points this paper is trying to address.

Detect cognitive impairments in dementia
Develop conversational speech system for robots
Evaluate speech biomarkers in human-robot interactions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Python WebSocket server integration
LLM fine-tuned for dementia
Real-time PWA biomarker display
πŸ”Ž Similar Papers
No similar papers found.
Y
Young-Ho Bae
Department of Data Science, Hanyang University, Seoul, Korea; AI Technology Center, POSCO DX, Pohang, Korea
D
Diego Izaguirre
Department of Computing & Digital Media, DePaul University, Chicago, IL USA
E
Esther Hwang
Department of Computing & Digital Media, DePaul University, Chicago, IL USA
A
Andrew Murphy
Department of Computing & Digital Media, DePaul University, Chicago, IL USA
Long-Jing Hsu
Long-Jing Hsu
University of Michigan - Ann Arbor
Human-Robot InteractionHCIOlder AdultsPeople Living with DementiaInformal Caregivers
S
Selma Sabanovic
School of Informatics, Computing, & Engineering, Indiana University, Bloomington IN, USA
C
Casey C. Bennett
Department of Computing & Digital Media, DePaul University, Chicago, IL USA