CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting

📅 2025-07-24

📈 Citations: 0

✨ Influential: 0

career value

220K/year

🤖 AI Summary

To address the challenges of low-fidelity environment reconstruction/editing and unrealistic, uncontrollable data augmentation in V2X scenarios, this paper proposes the first high-fidelity, editable 3D reconstruction framework tailored for vehicle-infrastructure cooperation. Methodologically, it introduces (1) a decomposition-based Gaussian point lattice representation enabling structured, editable 3D modeling of dynamic traffic agents; (2) the first integration of differentiable rendering into V2X data augmentation, supporting multi-view joint synthesis and consistency optimization across vehicle-mounted and roadside sensors; and (3) a V2X collaborative simulation augmentation pipeline unifying real-world reconstruction with controllable virtual editing. Evaluated on the V2X-Seq benchmark, our method significantly improves 3D detection and tracking performance across vehicle, infrastructure, and cooperative views, while efficiently generating diverse, high-difficulty edge cases—establishing a new paradigm for autonomous driving model training and robustness evaluation.

Technology Category

Application Category

📝 Abstract

Vehicle-to-everything (V2X) communication plays a crucial role in autonomous driving, enabling cooperation between vehicles and infrastructure. While simulation has significantly contributed to various autonomous driving tasks, its potential for data generation and augmentation in V2X scenarios remains underexplored. In this paper, we introduce CRUISE, a comprehensive reconstruction-and-synthesis framework designed for V2X driving environments. CRUISE employs decomposed Gaussian Splatting to accurately reconstruct real-world scenes while supporting flexible editing. By decomposing dynamic traffic participants into editable Gaussian representations, CRUISE allows for seamless modification and augmentation of driving scenes. Furthermore, the framework renders images from both ego-vehicle and infrastructure views, enabling large-scale V2X dataset augmentation for training and evaluation. Our experimental results demonstrate that: 1) CRUISE reconstructs real-world V2X driving scenes with high fidelity; 2) using CRUISE improves 3D detection across ego-vehicle, infrastructure, and cooperative views, as well as cooperative 3D tracking on the V2X-Seq benchmark; and 3) CRUISE effectively generates challenging corner cases.

Problem

Research questions and friction points this paper is trying to address.

Reconstructs V2X scenes using Gaussian Splatting for accuracy

Enables flexible editing of dynamic traffic participants

Augments V2X datasets for training and evaluation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Decomposed Gaussian Splatting for scene reconstruction

Editable Gaussian representations for dynamic objects

Multi-view rendering for V2X dataset augmentation

🔎 Similar Papers

V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges