GTATrack: Winner Solution to SoccerTrack 2025 with Deep-EIoU and Global Tracklet Association

📅 2025-10-27
🏛️ Proceedings of the 8th International ACM Workshop on Multimedia Content Analysis in Sports
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenging problem of multi-object tracking in soccer scenes captured by static fisheye cameras, where irregular motion, appearance similarity, frequent occlusions, and geometric distortion severely degrade tracking performance. To tackle these issues, the authors propose GTATrack, a hierarchical tracking framework that integrates Deep-EIoU for motion-agnostic online association and introduces Global Trajectory Association (GTA) for trajectory-level refinement. A pseudo-labeling strategy is further employed to enhance the recall of small-object detection. By synergistically combining local matching with global reasoning, the method effectively mitigates identity switches, occlusion-induced failures, and trajectory fragmentation. Evaluated on the SoccerTrack 2025 Challenge, GTATrack achieves state-of-the-art performance with an HOTA score of 0.60 and only 982 false positives, significantly outperforming existing approaches in fisheye-based soccer tracking scenarios.

Technology Category

Application Category

📝 Abstract
Multi-object tracking (MOT) in sports is highly challenging due to irregular player motion, uniform appearances, and frequent occlusions. These difficulties are further exacerbated by the geometric distortion and extreme scale variation introduced by static fisheye cameras. In this work, we present GTATrack, a hierarchical tracking framework that win first place in the SoccerTrack Challenge 2025. GTATrack integrates two core components: Deep Expansion IoU (Deep-EIoU) for motion-agnostic online association and Global Tracklet Association (GTA) for trajectory-level refinement. This two-stage design enables both robust short-term matching and long-term identity consistency. Additionally, a pseudo-labeling strategy is used to boost detector recall on small and distorted targets. The synergy between local association and global reasoning effectively addresses identity switches, occlusions, and tracking fragmentation. Our method achieved a winning HOTA score of 0.60 and significantly reduced false positives to 982, demonstrating state-of-the-art accuracy in fisheye-based soccer tracking.
Problem

Research questions and friction points this paper is trying to address.

multi-object tracking
fisheye camera
occlusions
identity switches
scale variation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep-EIoU
Global Tracklet Association
multi-object tracking
fisheye camera
pseudo-labeling
🔎 Similar Papers
No similar papers found.
R
Rong-Lin Jian
Institute of Intelligent Systems, College of AI, National Yang Ming Chiao Tung University, Tainan, Taiwan
M
Ming-Chi Luo
Department of Computer Science, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
C
Chen-Wei Huang
Master Program in Remote Sensing Science and Technology, National Central University, Taoyuan, Taiwan
Chia-Ming Lee
Chia-Ming Lee
National Yang Ming Chiao Tung University
Computer VisionImage ProcessingInformation ForensicsMultimedia
Yu-Fan Lin
Yu-Fan Lin
Institute of Data Science, National Cheng Kung University, Taiwan
computer visiondeep learningmultimodal learning
Chih-Chung Hsu
Chih-Chung Hsu
Associate Professor of Institute of Intelligent Systems, College of AI, NYCU
Deep learningImage processingcomputer visionimage compressionvideo editing