Triple Spectral Fusion for Sensor-based Human Activity Recognition

📅 2026-05-04
📈 Citations: 0
Influential: 0
📄 PDF

career value

193K/year
🤖 AI Summary
This work addresses the challenges of fusing multi-source heterogeneous IMU data and insufficient long-term contextual modeling by proposing a tri-spectral fusion framework. The framework introduces adaptive filtering mechanisms in the Fourier, graph Fourier, and wavelet domains to effectively integrate pose, motion, and contextual information. It innovatively constructs a dynamic heterogeneous graph that combines adaptive complementary filtering, graph Fourier transforms, wavelet-based frequency selection, and timestamp-aware graph aggregation to suppress noise and redundancy while enhancing multimodal fusion and long-range dependency modeling. Evaluated on ten benchmark datasets, the proposed method significantly outperforms existing approaches, achieving state-of-the-art performance.
📝 Abstract
The field of sensor-based human activity recognition (HAR) mainly uses posture, motion and context data of Inertial Measurement Units (IMUs) to identify daily activities. Despite the advancements in learning-based methods, it is challenging to perform information fusion from the temporal perspective due to the complexities in fusing heterogeneous sensor data and establishing long-term context correlations. This paper proposes a novel triple spectral fusion framework tailored for HAR. First, we develop an adaptive complementary filtering technique for noise suppression and organize each IMU's sensors into posture and motion modality nodes. Given that IMU nodes form a dynamic heterogeneous graph, we then apply adaptive filtering within the graph Fourier domain to merge both homogeneous and heterogeneous node information. Furthermore, an adaptive wavelet frequency selection approach is implemented to suppress context redundancy and shorten the length of features. This approach enhances both timestamp-based graph aggregation and the correlation of long-term contexts. Our framework uses adaptive filtering in the Fourier, graph Fourier, and wavelet domains, enabling effective multi-sensor fusion and context correlation. Extensive experiments on ten benchmark datasets demonstrate the superior performance of our framework. Project page: https://github.com/crocodilegogogo/TSF-TPAMI2026.
Problem

Research questions and friction points this paper is trying to address.

human activity recognition
sensor fusion
heterogeneous data
temporal fusion
long-term context
Innovation

Methods, ideas, or system contributions that make the work stand out.

triple spectral fusion
adaptive filtering
graph Fourier transform
wavelet frequency selection
heterogeneous sensor fusion