Multi-person Physics-based Pose Estimation for Combat Sports

📅 2025-04-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the ambiguity in multi-person 3D pose estimation during combat sports—characterized by rapid motions, severe occlusions, and tight interpersonal interactions under sparse multi-view settings. We propose a physics-aware joint optimization framework. Methodologically, it integrates Transformer-based multi-view 2D pose tracking, epipolar geometry constraints, long-term video object segmentation, and weighted triangulation. Crucially, we introduce the first multi-body physical trajectory joint optimization mechanism, incorporating kinematic constraints and rigid-body dynamics modeling to ensure spatiotemporal consistency and physical plausibility; spline-based smoothing and physics-informed refinement further enhance robustness. Our approach achieves state-of-the-art performance on a newly established elite boxing benchmark and multiple public datasets. To foster community advancement, we release a high-quality, manually annotated dataset.

Technology Category

Application Category

📝 Abstract
We propose a novel framework for accurate 3D human pose estimation in combat sports using sparse multi-camera setups. Our method integrates robust multi-view 2D pose tracking via a transformer-based top-down approach, employing epipolar geometry constraints and long-term video object segmentation for consistent identity tracking across views. Initial 3D poses are obtained through weighted triangulation and spline smoothing, followed by kinematic optimization to refine pose accuracy. We further enhance pose realism and robustness by introducing a multi-person physics-based trajectory optimization step, effectively addressing challenges such as rapid motions, occlusions, and close interactions. Experimental results on diverse datasets, including a new benchmark of elite boxing footage, demonstrate state-of-the-art performance. Additionally, we release comprehensive annotated video datasets to advance future research in multi-person pose estimation for combat sports.
Problem

Research questions and friction points this paper is trying to address.

Accurate 3D pose estimation in combat sports
Tracking multiple people with rapid motions
Handling occlusions and close interactions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer-based multi-view 2D pose tracking
Weighted triangulation with spline smoothing
Physics-based multi-person trajectory optimization
🔎 Similar Papers
No similar papers found.
H
Hossein Feiz
Ecole de technologie supérieure, Montreal, Canada
D
David Labb'e
Ecole de technologie supérieure, Montreal, Canada
T
Thomas Romeas
Université de Montréal, Montreal, Canada
Jocelyn Faubert
Jocelyn Faubert
Université de Montréal
neurosciencevisionperceptioncognitionpsychophysics
Sheldon Andrews
Sheldon Andrews
Professor, École de technologie supérieure (University of Quebec)
computer graphicsphysics-based animation3D charactersmotion capture