Joint angle model based learning to refine kinematic human pose estimation

📅 2025-07-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Markerless human pose estimation (HPE) suffers from keypoint misidentification and trajectory jitter, while existing deep learning models are constrained by noise inherent in manually annotated ground truth. To address these issues, this paper proposes a joint-angle-based pose optimization framework. First, a geometrically consistent joint-angle representation is constructed, and high-order Fourier series are employed to temporally fit authentic motion trajectories, generating high-fidelity synthetic ground truth. Second, a bidirectional recurrent neural network is designed to perform spatiotemporal refinement of HRNet’s output. By eliminating reliance on error-prone manual annotations, the method significantly improves keypoint localization accuracy and trajectory smoothness—particularly for dynamic, complex motions such as figure skating and breakdancing. Extensive experiments demonstrate state-of-the-art performance on pose refinement tasks, outperforming current SOTA approaches.

Technology Category

Application Category

📝 Abstract
Marker-free human pose estimation (HPE) has found increasing applications in various fields. Current HPE suffers from occasional errors in keypoint recognition and random fluctuation in keypoint trajectories when analyzing kinematic human poses. The performance of existing deep learning-based models for HPE refinement is considerably limited by inaccurate training datasets in which the keypoints are manually annotated. This paper proposed a novel method to overcome the difficulty through joint angle-based modeling. The key techniques include: (i) A joint angle-based model of human pose, which is robust to describe kinematic human poses; (ii) Approximating temporal variation of joint angles through high order Fourier series to get reliable "ground truth"; (iii) A bidirectional recurrent network is designed as a post-processing module to refine the estimation of well-established HRNet. Trained with the high-quality dataset constructed using our method, the network demonstrates outstanding performance to correct wrongly recognized joints and smooth their spatiotemporal trajectories. Tests show that joint angle-based refinement (JAR) outperforms the state-of-the-art HPE refinement network in challenging cases like figure skating and breaking.
Problem

Research questions and friction points this paper is trying to address.

Improves keypoint recognition in marker-free human pose estimation
Reduces random fluctuations in keypoint trajectories
Overcomes limitations from inaccurate manually annotated datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Joint angle-based model for robust pose description
High order Fourier series for reliable ground truth
Bidirectional recurrent network refining HRNet estimations
🔎 Similar Papers
No similar papers found.
C
Chang Peng
Department of Engineering Mechanics, School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510640, China
Y
Yifei Zhou
Department of Engineering Mechanics, School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510640, China
H
Huifeng Xi
School of Mechanics and Construction Engineering, Jinan University, Guangzhou 510632, China
S
Shiqing Huang
School of Mechanics and Construction Engineering, Jinan University, Guangzhou 510632, China
C
Chuangye Chen
Guangdong Provincial Key Laboratory of Speed Capability, School of Physical Education, Jinan University, Guangzhou 510632, China
J
Jianming Yang
Guangdong Provincial Key Laboratory of Speed Capability, School of Physical Education, Jinan University, Guangzhou 510632, China
Bao Yang
Bao Yang
Professor of Cold and Arid Regions Environmental and Engineering Research Institute, Chinese Academy of Sciences, Lanzhou, Gansu
dendroclimatologytree ringspast global change
Zhenyu Jiang
Zhenyu Jiang
Research, Amazon
Computer visionrobotics