Improving skeleton-based action recognition with interactive object information

๐Ÿ“… 2025-01-07
๐Ÿ›๏ธ International Journal of Multimedia Information Retrieval
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the lack of object-interaction semantic modeling in skeleton-based action recognition, this paper proposes an object-aware end-to-end action understanding framework. The core method introduces learnable and optimizable object representations to explicitly model spatio-temporal associations between human joints and scene objects. It incorporates an attention-driven interaction modeling mechanism, a multimodal feature fusion module, and a differentiable object-relation reasoning moduleโ€”marking the first integration of explicit object information into graph convolutional network (GCN)-based skeleton action recognition architectures. Evaluated on NTU-60 and NTU-120 benchmarks, the approach achieves absolute accuracy improvements of 2.3% and 1.9%, respectively, significantly outperforming skeleton-only baselines. These results empirically validate the critical contribution of object-interaction semantics to action recognition performance.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Action Recognition
Skeleton-Based
Object-Related Actions
Innovation

Methods, ideas, or system contributions that make the work stand out.

ST-VGCN Network
Item Node Integration
Enhanced Action Recognition