Beyond the Individual: Introducing Group Intention Forecasting with SHOT Dataset

📅 2025-09-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional intent recognition focuses on individual agents, making it difficult to model collective goal formation in group collaboration. This paper formally introduces the novel task of *group intent prediction*, aiming to infer dynamically evolving group-level objectives from individual actions and interactions. To support this task, we present SHOT—the first large-scale, multi-view basketball video dataset—comprising 1,979 game clips annotated with six fine-grained individual attributes, capturing multi-agent, multi-view, and hierarchical intent characteristics. We further propose GIFT, a unified framework that jointly models fine-grained individual features and dynamic interaction graphs across multiple camera views to capture the spatiotemporal evolution of group intent. Extensive experiments demonstrate that SHOT significantly advances group intent modeling capability, and GIFT consistently outperforms all baselines across standard metrics. This work establishes foundational resources and methodology for understanding collaborative group behavior in complex, real-world settings.

Technology Category

Application Category

📝 Abstract
Intention recognition has traditionally focused on individual intentions, overlooking the complexities of collective intentions in group settings. To address this limitation, we introduce the concept of group intention, which represents shared goals emerging through the actions of multiple individuals, and Group Intention Forecasting (GIF), a novel task that forecasts when group intentions will occur by analyzing individual actions and interactions before the collective goal becomes apparent. To investigate GIF in a specific scenario, we propose SHOT, the first large-scale dataset for GIF, consisting of 1,979 basketball video clips captured from 5 camera views and annotated with 6 types of individual attributes. SHOT is designed with 3 key characteristics: multi-individual information, multi-view adaptability, and multi-level intention, making it well-suited for studying emerging group intentions. Furthermore, we introduce GIFT (Group Intention ForecasTer), a framework that extracts fine-grained individual features and models evolving group dynamics to forecast intention emergence. Experimental results confirm the effectiveness of SHOT and GIFT, establishing a strong foundation for future research in group intention forecasting. The dataset is available at https://xinyi-hu.github.io/SHOT_DATASET.
Problem

Research questions and friction points this paper is trying to address.

Forecasting when shared group intentions emerge from individual actions
Addressing limitations of individual intention recognition in collective settings
Analyzing multi-individual interactions before collective goals become apparent
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces group intention forecasting for collective goals
Proposes SHOT dataset with multi-view basketball videos
Develops GIFT framework modeling individual and group dynamics
🔎 Similar Papers
No similar papers found.
Ruixu Zhang
Ruixu Zhang
Tsinghua University
Y
Yuran Wang
School of Mathematical Sciences, Peking University, Beijing, China; National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, China
X
Xinyi Hu
National Engineering Research Center for Multimedia Software, Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
C
Chaoyu Mai
National Engineering Research Center for Multimedia Software, Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
W
Wenxuan Liu
School of Computer Science, Peking University; State Key Laboratory for Multimedia Information Processing, Peking University, Beijing, China
Danni Xu
Danni Xu
NUS
misinformation LMMs
X
Xian Zhong
Hubei Key Laboratory of Transportation Internet of Things, School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China
Z
Zheng Wang
National Engineering Research Center for Multimedia Software, Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China