🤖 AI Summary
To address low trajectory prediction accuracy caused by limited sensor field-of-view and occlusions in single-vehicle perception, this paper proposes V2INet—the first end-to-end multi-view trajectory prediction framework designed for V2X cooperative driving. Methodologically, it fuses heterogeneous V2V and V2I observations, employs a Transformer for cross-view feature alignment, leverages graph neural networks to model inter-vehicle and infrastructure-vehicle interactions, and introduces a novel posterior conformal prediction module to generate statistically valid confidence regions—marking the first application of conformal prediction in trajectory forecasting. Evaluated on the V2X-Seq dataset, V2INet achieves a 19.3% reduction in Final Displacement Error (FDE) and a 26.7% decrease in Miss Rate (MR), while maintaining real-time inference on a single GPU. The source code is publicly available.
📝 Abstract
Current research on trajectory prediction primarily relies on data collected by onboard sensors of an ego vehicle. With the rapid advancement in connected technologies, such as vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) communication, valuable information from alternate views becomes accessible via wireless networks. The integration of information from alternative views has the potential to overcome the inherent limitations associated with a single viewpoint, such as occlusions and limited field of view. In this work, we introduce V2INet, a novel trajectory prediction framework designed to model multi-view data by extending existing single-view models. Unlike previous approaches where the multi-view data is manually fused or formulated as a separate training stage, our model supports end-to-end training, enhancing both flexibility and performance. Moreover, the predicted multimodal trajectories are calibrated by a post-hoc conformal prediction module to get valid and efficient confidence regions. We evaluated the entire framework using the real-world V2I dataset V2X-Seq. Our results demonstrate superior performance in terms of Final Displacement Error (FDE) and Miss Rate (MR) using a single GPU. The code is publicly available at: url{https://github.com/xichennn/V2I_trajectory_prediction}.