OmniRe: Omni Urban Scene Reconstruction

📅 2024-08-29
🏛️ arXiv.org
📈 Citations: 30
Influential: 7
📄 PDF
🤖 AI Summary
High-fidelity digital twin construction for diverse dynamic foreground objects—vehicles, pedestrians, and cyclists—in urban scenes remains challenging due to their heterogeneous motion patterns and appearance characteristics. Method: We propose the first unified 3D Gaussian Splatting (3DGS)-based scene graph framework supporting *all* dynamic foreground categories. Our approach establishes decoupled, object-specific Gaussian representations in canonical space, jointly optimized via multi-canonical-space modeling and end-to-end training, enabling real-time 60 Hz co-simulation. Contribution/Results: Unlike prior methods limited to vehicles, ours generalizes across dynamic agent types, significantly improving reconstruction completeness and physical consistency. It outperforms state-of-the-art on Waymo and demonstrates strong generalization and robustness across five major autonomous driving datasets—including complex urban scenarios. This advances human-vehicle interaction analysis and realistic human behavior simulation.

Technology Category

Application Category

📝 Abstract
We introduce OmniRe, a comprehensive system for efficiently creating high-fidelity digital twins of dynamic real-world scenes from on-device logs. Recent methods using neural fields or Gaussian Splatting primarily focus on vehicles, hindering a holistic framework for all dynamic foregrounds demanded by downstream applications, e.g., the simulation of human behavior. OmniRe extends beyond vehicle modeling to enable accurate, full-length reconstruction of diverse dynamic objects in urban scenes. Our approach builds scene graphs on 3DGS and constructs multiple Gaussian representations in canonical spaces that model various dynamic actors, including vehicles, pedestrians, cyclists, and others. OmniRe allows holistically reconstructing any dynamic object in the scene, enabling advanced simulations (~60Hz) that include human-participated scenarios, such as pedestrian behavior simulation and human-vehicle interaction. This comprehensive simulation capability is unmatched by existing methods. Extensive evaluations on the Waymo dataset show that our approach outperforms prior state-of-the-art methods quantitatively and qualitatively by a large margin. We further extend our results to 5 additional popular driving datasets to demonstrate its generalizability on common urban scenes.
Problem

Research questions and friction points this paper is trying to address.

Reconstruct diverse dynamic objects in urban scenes
Enable advanced simulations with human-participated scenarios
Outperform prior methods on fidelity and generalizability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Scene graphs on 3DGS for dynamic objects
Multiple Gaussian representations in canonical spaces
Holistic reconstruction enabling human-participated simulations
🔎 Similar Papers
No similar papers found.
Ziyu Chen
Ziyu Chen
Chonqing University
DCOPsMAS
J
Jiawei Yang
University of Southern California
Jiahui Huang
Jiahui Huang
NVIDIA
3D Computer VisionGraphics
R
Riccardo de Lutio
NVIDIA Research
J
Janick Martinez Esturo
NVIDIA Research
B
B. Ivanovic
NVIDIA Research
Or Litany
Or Litany
Technion, NVIDIA
Computer VisionArtificial IntelligenceDeep LearningAI
Z
Z. Gojcic
NVIDIA Research
S
S. Fidler
University of Toronto
Marco Pavone
Marco Pavone
Stanford University and NVIDIA
RoboticsControl TheoryDistributed ControlIntelligent Transportation systems
L
Li-Na Song
Shanghai Jiao Tong University
Y
Yue Wang
NVIDIA Research, University of Southern California