Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation

📅 2025-03-25

📈 Citations: 0

✨ Influential: 0

career value

231K/year

🤖 AI Summary

This work systematically investigates the sources of domain gap between synthetic and real data in 3D hand pose estimation—a problem lacking principled attribution analysis. We propose the first interpretable domain gap decomposition framework, quantifying four key factors: forearm modeling mismatch, image-frequency statistical discrepancies, hand pose distribution shift, and object occlusion divergence. Methodologically, we integrate a controllable rendering pipeline for synthetic data generation, frequency-domain feature analysis, pose-occlusion decoupled modeling, and cross-domain error attribution evaluation. On standard benchmarks (e.g., FreiHAND, HO3D), models trained solely on synthetic data achieve accuracy comparable to those trained on real data—exhibiting less than 0.5 mm mean joint error difference—thereby substantially narrowing the domain gap. To foster reproducibility and further research, our code and dataset are publicly released.

Technology Category

Application Category

📝 Abstract

Recent synthetic 3D human datasets for the face, body, and hands have pushed the limits on photorealism. Face recognition and body pose estimation have achieved state-of-the-art performance using synthetic training data alone, but for the hand, there is still a large synthetic-to-real gap. This paper presents the first systematic study of the synthetic-to-real gap of 3D hand pose estimation. We analyze the gap and identify key components such as the forearm, image frequency statistics, hand pose, and object occlusions. To facilitate our analysis, we propose a data synthesis pipeline to synthesize high-quality data. We demonstrate that synthetic hand data can achieve the same level of accuracy as real data when integrating our identified components, paving the path to use synthetic data alone for hand pose estimation. Code and data are available at: https://github.com/delaprada/HandSynthesis.git.

Problem

Research questions and friction points this paper is trying to address.

Analyzing synthetic-to-real gap in 3D hand pose estimation

Identifying key components causing the domain gap

Improving synthetic data accuracy to match real data

Innovation

Methods, ideas, or system contributions that make the work stand out.

Systematic study of synthetic-to-real gap

Data synthesis pipeline for high-quality data

Synthetic data matches real data accuracy

🔎 Similar Papers

WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild

2024-09-18arXiv.orgCitations: 8