🤖 AI Summary
To address the clinical need for non-invasive, interpretable early screening of Parkinson’s disease (PD), this study proposes a speech-driven deep learning framework. First, multidimensional acoustic features—including MFCCs, jitter, and shimmer—are extracted from sustained phonations. Second, an end-to-end classification model is built, integrated with an attention-guided attribution mechanism to localize discriminative phonatory segments at the utterance level. Third, attribution outputs are mapped onto clinically meaningful dysarthria phenotypes—such as delayed voice onset and tremulous vocal fold oscillation—to infer underlying neuromuscular impairments. To our knowledge, this is the first approach that simultaneously achieves state-of-the-art classification accuracy (831 samples from the Italian PD Speech Database) and fine-grained, clinically interpretable, segment-level explainability. The framework enhances model transparency and diagnostic utility for clinical decision support.
📝 Abstract
Parkinson's disease is a progressive neurodegenerative disorder affecting motor and non-motor functions, with speech impairments among its earliest symptoms. Speech impairments offer a valuable diagnostic opportunity, with machine learning advances providing promising tools for timely detection. In this research, we propose a deep learning approach for early Parkinson's disease detection from speech recordings, which also highlights the vocal segments driving predictions to enhance interpretability. This approach seeks to associate predictive speech patterns with articulatory features, providing a basis for interpreting underlying neuromuscular impairments. We evaluated our approach using the Italian Parkinson's Voice and Speech Database, containing 831 audio recordings from 65 participants, including both healthy individuals and patients. Our approach showed competitive classification performance compared to state-of-the-art methods, while providing enhanced interpretability by identifying key speech features influencing predictions.