TOTNet: Occlusion-Aware Temporal Tracking for Robust Ball Detection in Sports Videos

πŸ“… 2025-08-13
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Insufficient robustness in ball tracking under occlusion in sports videos severely hampers event detection and referee assistance. To address this, we propose TOMNetβ€”a ball detection and tracking framework specifically designed for highly occluded scenarios. TOMNet innovatively integrates 3D convolutional networks to model spatiotemporal dynamics, a visibility-weighted loss function that explicitly encodes occlusion states, and a targeted occlusion-aware data augmentation strategy. Furthermore, we introduce TTA, the first publicly available table-tennis benchmark dataset featuring high occlusion rates. Extensive experiments across four public datasets demonstrate that TOMNet significantly improves tracking accuracy: RMSE decreases from 37.30 to 7.19 pixels, and localization accuracy on fully occluded frames rises from 0.63 to 0.80. TOMNet consistently outperforms state-of-the-art methods across all evaluated metrics.

Technology Category

Application Category

πŸ“ Abstract
Robust ball tracking under occlusion remains a key challenge in sports video analysis, affecting tasks like event detection and officiating. We present TOTNet, a Temporal Occlusion Tracking Network that leverages 3D convolutions, visibility-weighted loss, and occlusion augmentation to improve performance under partial and full occlusions. Developed in collaboration with Paralympics Australia, TOTNet is designed for real-world sports analytics. We introduce TTA, a new occlusion-rich table tennis dataset collected from professional-level Paralympic matches, comprising 9,159 samples with 1,996 occlusion cases. Evaluated on four datasets across tennis, badminton, and table tennis, TOTNet significantly outperforms prior state-of-the-art methods, reducing RMSE from 37.30 to 7.19 and improving accuracy on fully occluded frames from 0.63 to 0.80. These results demonstrate TOTNets effectiveness for offline sports analytics in fast-paced scenarios. Code and data access:href{https://github.com/AugustRushG/TOTNet}{AugustRushG/TOTNet}.
Problem

Research questions and friction points this paper is trying to address.

Robust ball tracking under occlusion in sports videos
Improving detection accuracy during partial and full occlusions
Addressing challenges in fast-paced sports analytics scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses 3D convolutions for temporal tracking
Employs visibility-weighted loss for occlusion handling
Applies occlusion augmentation to improve robustness
πŸ”Ž Similar Papers
No similar papers found.
H
Hao Xu
Deakin University, Melbourne, Australia
A
Arbind Agrahari Baniya
Deakin University, Melbourne, Australia
S
Sam Wells
Paralympics Australia, Melbourne, Australia
Mohamed Reda Bouadjenek
Mohamed Reda Bouadjenek
Deakin University
Artificial IntelligenceMachine LearningInformation Retrieval
R
Richard Dazely
Deakin University, Melbourne, Australia
Sunil Aryal
Sunil Aryal
Deakin University Australia
Data miningMachine learning