Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer

📅 2025-04-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the joint inversion of ball spin direction and full 3D trajectory from monocular broadcast videos of table tennis, using only 2D ball-center trajectories—without any real-world annotated data. Method: We propose a novel physics-guided learning framework driven exclusively by synthetic data: high-fidelity synthetic videos are generated via rigid-body dynamics; a unified architecture integrates 2D trajectory encoding, geometry-aware feature enhancement, and physics-constrained inversion to jointly optimize spin classification and 3D trajectory regression in an end-to-end manner. Contribution/Results: Crucially, the method eliminates reliance on real-data fine-tuning, achieving cross-domain generalization through principled physical modeling and augmentation. Experiments demonstrate state-of-the-art performance: 92.0% accuracy in spin-direction classification and a 2D reprojection error of only 0.19% of the image diagonal length.

Technology Category

Application Category

📝 Abstract
Analyzing a player's technique in table tennis requires knowledge of the ball's 3D trajectory and spin. While, the spin is not directly observable in standard broadcasting videos, we show that it can be inferred from the ball's trajectory in the video. We present a novel method to infer the initial spin and 3D trajectory from the corresponding 2D trajectory in a video. Without ground truth labels for broadcast videos, we train a neural network solely on synthetic data. Due to the choice of our input data representation, physically correct synthetic training data, and using targeted augmentations, the network naturally generalizes to real data. Notably, these simple techniques are sufficient to achieve generalization. No real data at all is required for training. To the best of our knowledge, we are the first to present a method for spin and trajectory prediction in simple monocular broadcast videos, achieving an accuracy of 92.0% in spin classification and a 2D reprojection error of 0.19% of the image diagonal.
Problem

Research questions and friction points this paper is trying to address.

Infer 3D ball trajectory and spin from 2D videos
Train neural network using only synthetic data
Achieve accurate spin and trajectory prediction in monocular videos
Innovation

Methods, ideas, or system contributions that make the work stand out.

Infer spin from 2D trajectory in videos
Train neural network solely on synthetic data
Achieve generalization without real training data
🔎 Similar Papers
No similar papers found.
Daniel Kienzle
Daniel Kienzle
University of Augsburg
Computer Vision
R
Robin Schon
University of Augsburg, Germany
Rainer Lienhart
Rainer Lienhart
Professor of Computer Science, University of Augsburg
Machine LearningComputer VisionMultimedia Computing
S
Shin'Ichi Satoh
National Institute of Informatics, Japan; University of Tokyo, Japan