🤖 AI Summary
This work addresses the scarcity of annotated group activity data in team sports video retrieval by proposing a self-supervised feature learning framework that does not require predefined category labels. The approach integrates human-in-the-loop self-supervised pretraining with a data-efficient interactive fine-tuning mechanism, dynamically refining the feature space through user feedback and supported by an efficient video sampling strategy. Experiments on two team sports datasets demonstrate that the proposed method significantly improves retrieval performance in unsupervised settings, while ablation studies confirm the effectiveness of each component.