Beyond Words: Measuring User Experience through Speech Analysis in Voice User Interfaces

📅 2026-03-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the limitations of conventional user experience (UX) evaluation methods for voice assistants, which predominantly rely on task completion rates and subjective questionnaires and thus fail to capture users’ genuine emotions and interaction difficulties in real time. To overcome this, the work proposes a novel implicit evaluation paradigm that requires no active user feedback by systematically investigating the relationship between paralinguistic vocal features and UX. The authors extract time-domain, spectral, and linguistic speech features and employ machine learning models to construct a voice-based UX classifier. Experimental results demonstrate that multiple vocal features are significantly correlated with user satisfaction, and the proposed model effectively differentiates among distinct UX levels with strong classification accuracy.

Technology Category

Application Category

📝 Abstract
Voice assistants (VAs) are typically evaluated through task performance metrics and self-report questionnaires, but people's voices themselves carry rich paralinguistic cues that reveal affect, effort, and interaction breakdowns. We present a within-subjects study (N=49) that systematically compared three VA personas across three usage scenarios to investigate whether speech-derived audio features can serve as a proxy for user experience (UX). Participants' speech was analyzed for temporal, spectral, and linguistic markers, alongside standardized UX measures, brief mood and stress ratings, and a post-study questionnaire. We found correlations between specific speech features and self-reported satisfaction and experience. Furthermore, a machine learning model trained on speech features achieved promising accuracy in classifying UX levels, indicating that this might be a reasonable alternative to self-report instruments. Our findings establish speech as a viable, real-time signal for implicitly measuring UX and point toward adaptive VUIs that respond dynamically to emotional and usability-related vocal cues.
Problem

Research questions and friction points this paper is trying to address.

user experience
speech analysis
voice user interfaces
paralinguistic cues
implicit measurement
Innovation

Methods, ideas, or system contributions that make the work stand out.

speech analysis
user experience
voice user interfaces
paralinguistic cues
implicit UX measurement
🔎 Similar Papers
No similar papers found.