Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition

📅 2023-08-27
🏛️ Machine Intelligence Research
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the long-tailed distribution problem in skeleton-based action recognition, this paper proposes a decoupled representation learning framework that jointly optimizes class balance in both feature space and classifier space. The method innovatively integrates representation disentanglement with dynamic class reweighting, incorporating prototype alignment regularization and tail-class-aware contrastive learning to effectively mitigate feature shift and classifier bias. Built upon graph convolutional networks (GCNs), it employs a prototype memory bank, dynamic label smoothing, and class-aware feature reweighting. Evaluated on the NTU-60 and NTU-120 long-tailed benchmarks, the approach achieves an overall accuracy improvement of 5.2% and a substantial 12.7% gain in tail-class accuracy, surpassing existing state-of-the-art methods.
Problem

Research questions and friction points this paper is trying to address.

Addresses class imbalance in action recognition
Improves representation learning for tail classes
Enhances generalization in skeleton-based datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Spatial-temporal action exploration strategy
Detached action-aware learning schedule
Skip-modal representation for structural information
🔎 Similar Papers
No similar papers found.
Hongda Liu
Hongda Liu
Sun Yat-sen University
Computer VisionLow-level VisionImage RestorationStyle Transfer
Y
Yunlong Wang
Center for Research on Intelligent Perception and Computing, State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Min Ren
Min Ren
Continental Advanced Lidar Solutions US, LLC
PhotonicsAvalanche PhotodiodesSingle Photon DectectorsSingle Photon DetectionLidar
Junxing Hu
Junxing Hu
University of Chinese Academy of Sciences
AI AgentComputer Vision3D VisionBiometrics
Z
Zhengquan Luo
University of Science and Technology of China, Hefei 230027, China, and also with the Center for Research on Intelligent Perception and Computing, State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
G
Guangqi Hou
Center for Research on Intelligent Perception and Computing, State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Zhenan Sun
Zhenan Sun
Institute of Automation, Chinese Academy of Sciences
BiometricsPattern RecognitionComputer Vision