Towards an End-To-End System for Real-Time Gesture Recognition from Surface Vibrations

📅 2026-05-11
📈 Citations: 0
Influential: 0
📄 PDF

career value

218K/year
🤖 AI Summary
This work proposes the first end-to-end real-time framework for surface vibration-based gesture recognition, addressing the limitation of existing approaches that focus only on isolated components. The system integrates piezoelectric sensing, configurable signal preprocessing, and lightweight model training into a unified pipeline. A modular data processing chain is established using bandpass filtering, fixed-length windowing, and min–max normalization, followed by a depthwise separable one-dimensional convolutional neural network containing only 8,722 parameters. Evaluated on a dataset of six gestures collected from 15 participants, the framework achieves consistently high accuracy across multiple partitioning strategies, with particularly strong performance in user-independent leave-one-subject-out cross-validation, thereby demonstrating the effectiveness of jointly optimizing preprocessing steps and model hyperparameters.
📝 Abstract
Sensing surface vibrations promise unobtrusive interaction for smart home systems by enabling gesture recognition on existing everyday surfaces without disturbing living-space design. Existing approaches typically address only parts of the processing chain, such as sensing hardware or offline gesture recognition, rather than providing an end-to-end system from surface-mounted sensors to the evaluation of the prediction model. This paper presents a custom sensor system and a configurable data-to-model pipeline for gesture recognition on a standard office desk. Our hardware enables a low-noise sensing of the vibrations using piezoelectric sensors. Building on a modular signal-processing framework, we model the full chain from continuous recordings through variable pre-processing to a model-ready dataset, and process the resulting data with compact depthwise separable 1D-CNNs. We conduct a joint search over pre-processing and model hyperparameters and identify a configuration with 8,722 parameters that uses band-pass filtering, fixed-length windows, and min-max normalization. On a self-recorded dataset with 15 participants performing six gestures this configuration achieves high accuracies across different data splitting methods, including strong user-independent performance in a leave-one-subject-out cross-validation.
Problem

Research questions and friction points this paper is trying to address.

gesture recognition
surface vibrations
end-to-end system
real-time interaction
smart home
Innovation

Methods, ideas, or system contributions that make the work stand out.

end-to-end system
surface vibration sensing
depthwise separable 1D-CNN
gesture recognition
modular signal processing
🔎 Similar Papers
No similar papers found.
F
Florian Hettstedt
Intelligent Embedded Systems Laboratory, University of Duisburg-Essen, Germany; PALUNO, The Ruhr Institute for Software Technology, Essen, Germany
C
Cedric Giese
Intelligent Embedded Systems Laboratory, University of Duisburg-Essen, Germany
T
Tianheng Ling
Intelligent Embedded Systems Laboratory, University of Duisburg-Essen, Germany; PALUNO, The Ruhr Institute for Software Technology, Essen, Germany
K
Keiichi Yasumoto
Ubiquitous Computing Systems Laboratory, Nara Institute of Science and Technology, Nara, Japan; RIKEN, Center for Advanced Intelligence Project, Tokyo, Japan
Gregor Schiele
Gregor Schiele
Professor of Computer Science (Embedded Systems), University Duisburg-Essen, Germany
embedded AIIoTembedded softwareadaptive SW and reconfigurable HW
A
Andreas Erbslöh
Intelligent Embedded Systems Laboratory, University of Duisburg-Essen, Germany; PALUNO, The Ruhr Institute for Software Technology, Essen, Germany