🤖 AI Summary
Existing V2X autonomous driving systems primarily enhance isolated functionalities, lacking end-to-end co-optimization across the full perception–mapping–prediction–planning pipeline; moreover, they suffer from inefficient data transmission and fusion under bandwidth constraints. This paper proposes UniV2X, a unified framework featuring a novel sparse-dense hybrid transmission and fusion mechanism, multi-view feature alignment, joint encoding of semantic point clouds and BEV features, a lightweight communication-adaptation module, and a differentiable planning head—enabling holistic end-to-end optimization of the entire V2X stack. Evaluated on DAIR-V2X, UniV2X achieves a 12.7% improvement in planning success rate, with significant gains across all intermediate tasks (e.g., detection, tracking, mapping), while maintaining real-time inference latency. The code is publicly available.
📝 Abstract
Cooperatively utilizing both ego-vehicle and infrastructure sensor data via V2X communication has emerged as a promising approach for advanced autonomous driving. However, current research mainly focuses on improving individual modules, rather than taking end-to-end learning to optimize final planning performance, resulting in underutilized data potential. In this paper, we introduce UniV2X, a pioneering cooperative autonomous driving framework that seamlessly integrates all key driving modules across diverse views into a unified network. We propose a sparse-dense hybrid data transmission and fusion mechanism for effective vehicle-infrastructure cooperation, offering three advantages: 1) Effective for simultaneously enhancing agent perception, online mapping, and occupancy prediction, ultimately improving planning performance. 2) Transmission-friendly for practical and limited communication conditions. 3) Reliable data fusion with interpretability of this hybrid data. We implement UniV2X, as well as reproducing several benchmark methods, on the challenging DAIR-V2X, the real-world cooperative driving dataset. Experimental results demonstrate the effectiveness of UniV2X in significantly enhancing planning performance, as well as all intermediate output performance. The project is available at href{https://github.com/AIR-THU/UniV2X}{https://github.com/AIR-THU/UniV2X}.