Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment

📅 2025-06-01

📈 Citations: 0

✨ Influential: 0

career value

184K/year

🤖 AI Summary

This study addresses the challenge of modeling long-term rhythmic abnormalities in dementia-related speech. We propose Rhythmic Formant Analysis (RFA), a novel method that constructs AM/FM rhythmic spectrograms to explicitly characterize slow temporal modulations in speech signals. Complementing this, we design handcrafted rhythmic morphological features and introduce a ViT-BERT multimodal fusion paradigm to jointly model the visual structural patterns of rhythmic spectrograms and linguistic semantic information. Experimental results demonstrate that our handcrafted features improve classification accuracy by 14.2% over the eGeMAPs baseline. Moreover, the RFA-based spectrogram fusion achieves a 13.1% gain in classification performance compared to conventional Mel-spectrograms and attains state-of-the-art performance on dementia severity regression. This work establishes a new feature representation and a principled multimodal modeling framework for non-invasive, speech-based dementia screening.

Technology Category

Application Category

📝 Abstract

This study explores the potential of Rhythm Formant Analysis (RFA) to capture long-term temporal modulations in dementia speech. Specifically, we introduce RFA-derived rhythm spectrograms as novel features for dementia classification and regression tasks. We propose two methodologies: (1) handcrafted features derived from rhythm spectrograms, and (2) a data-driven fusion approach, integrating proposed RFA-derived rhythm spectrograms with vision transformer (ViT) for acoustic representations along with BERT-based linguistic embeddings. We compare these with existing features. Notably, our handcrafted features outperform eGeMAPs with a relative improvement of $14.2%$ in classification accuracy and comparable performance in the regression task. The fusion approach also shows improvement, with RFA spectrograms surpassing Mel spectrograms in classification by around a relative improvement of $13.1%$ and a comparable regression score with the baselines.

Problem

Research questions and friction points this paper is trying to address.

Develop RFA-derived rhythm spectrograms for dementia speech analysis

Compare handcrafted and fusion methods for dementia classification tasks

Improve classification accuracy using RFA spectrograms over existing features

Innovation

Methods, ideas, or system contributions that make the work stand out.

RFA-derived rhythm spectrograms for dementia classification

Handcrafted features from rhythm spectrograms outperform eGeMAPs

Fusion of RFA spectrograms with ViT and BERT embeddings

🔎 Similar Papers

Toward Robust Early Detection of Alzheimer's Disease via an Integrated Multimodal Learning Approach