Multi Class Parkinsons Disease Detection Based on Finger Tapping Using Attention-Enhanced CNN BiLSTM

📅 2025-10-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of non-invasive, accurate staging of Parkinson’s disease (PD) severity, this paper proposes an end-to-end multi-level classification method based on finger-tapping videos. We extract time-frequency-amplitude multimodal motion features and design a hybrid deep learning model integrating Conv1D, bidirectional LSTM (BiLSTM), and a dual-channel–temporal attention mechanism to jointly model spatiotemporal dependencies and enhance discriminative representation of critical temporal segments. Evaluated on the five-class Hoehn–Yahr staging task, our model significantly outperforms existing gesture-based PD assessment methods. To the best of our knowledge, this is the first work to apply a dual-attention–guided CNN-BiLSTM architecture for fine-grained PD severity grading. We empirically demonstrate that synergistic spatiotemporal feature modeling and attention-driven focus on clinically salient frames substantially improve both classification accuracy and clinical interpretability. This study establishes a novel paradigm for remote, unobtrusive monitoring of PD progression.

Technology Category

Application Category

📝 Abstract
Effective clinical management and intervention development depend on accurate evaluation of Parkinsons disease (PD) severity. Many researchers have worked on developing gesture-based PD recognition systems; however, their performance accuracy is not satisfactory. In this study, we propose a multi-class Parkinson Disease detection system based on finger tapping using an attention-enhanced CNN BiLSTM. We collected finger tapping videos and derived temporal, frequency, and amplitude based features from wrist and hand movements. Then, we proposed a hybrid deep learning framework integrating CNN, BiLSTM, and attention mechanisms for multi-class PD severity classification from video-derived motion features. First, the input sequence is reshaped and passed through a Conv1D MaxPooling block to capture local spatial dependencies. The resulting feature maps are fed into a BiLSTM layer to model temporal dynamics. An attention mechanism focuses on the most informative temporal features, producing a context vector that is further processed by a second BiLSTM layer. CNN-derived features and attention-enhanced BiLSTM outputs are concatenated, followed by dense and dropout layers, before the final softmax classifier outputs the predicted PD severity level. The model demonstrated strong performance in distinguishing between the five severity classes, suggesting that integrating spatial temporal representations with attention mechanisms can improve automated PD severity detection, making it a promising non-invasive tool to support clinicians in PD monitoring and progression tracking.
Problem

Research questions and friction points this paper is trying to address.

Developing accurate multi-class Parkinson's disease severity detection system
Classifying PD severity using finger tapping videos with deep learning
Improving automated PD assessment through spatial-temporal feature integration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Attention-enhanced CNN BiLSTM for PD detection
Hybrid deep learning with spatial-temporal feature extraction
Video-derived motion features with multi-class classification
🔎 Similar Papers
No similar papers found.
A
Abu Saleh Musa Miah
School of Computer Science and Engineering, University of Aizu, Fukushima, Japan
M
Md Maruf Al Hossain
Department of Computer Science and Engineering, Army University of Science and Technology, Nilphamari, Bangladesh
Najmul Hassan
Najmul Hassan
University of Oregon
Computer VisionDeep Learning
Y
Yuichi Okuyama
School of Computer Science and Engineering, University of Aizu, Fukushima, Japan
J
Jungpil Shin
School of Computer Science and Engineering, University of Aizu, Fukushima, Japan