LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering

📅 2025-10-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address cumulative pose drift and scale ambiguity in 3D Gaussian Splatting SLAM for large-scale dynamic outdoor scenes, this paper proposes a LiDAR-visual fusion hierarchical collaborative Gaussian SLAM system. The method introduces two key innovations: (1) an explicit–implicit hierarchical representation mechanism that enables human-like chain-of-thought multimodal collaboration for mutual enhancement; and (2) a joint dynamic modeling module integrating open-world semantic segmentation with DINO-Depth–driven uncertainty-aware implicit residual constraints to generate fine-grained dynamic masks. Evaluated on KITTI, nuScenes, and a custom dataset, the approach achieves state-of-the-art performance—significantly suppressing scale drift while improving dynamic object removal accuracy and reconstruction robustness.

Technology Category

Application Category

📝 Abstract
3D Gaussian Splatting SLAM has emerged as a widely used technique for high-fidelity mapping in spatial intelligence. However, existing methods often rely on a single representation scheme, which limits their performance in large-scale dynamic outdoor scenes and leads to cumulative pose errors and scale ambiguity. To address these challenges, we propose extbf{LVD-GS}, a novel LiDAR-Visual 3D Gaussian Splatting SLAM system. Motivated by the human chain-of-thought process for information seeking, we introduce a hierarchical collaborative representation module that facilitates mutual reinforcement for mapping optimization, effectively mitigating scale drift and enhancing reconstruction robustness. Furthermore, to effectively eliminate the influence of dynamic objects, we propose a joint dynamic modeling module that generates fine-grained dynamic masks by fusing open-world segmentation with implicit residual constraints, guided by uncertainty estimates from DINO-Depth features. Extensive evaluations on KITTI, nuScenes, and self-collected datasets demonstrate that our approach achieves state-of-the-art performance compared to existing methods.
Problem

Research questions and friction points this paper is trying to address.

Addressing scale drift in dynamic SLAM systems
Eliminating dynamic object interference in reconstruction
Enhancing mapping robustness with collaborative representations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical explicit-implicit representation collaboration for mapping
Joint dynamic modeling module with fine-grained masks
Uncertainty-guided fusion of segmentation and depth features
🔎 Similar Papers
W
Wenkai Zhu
School of Instrument Science and Engineering, Southeast University, Nanjing, China
X
Xu Li
School of Instrument Science and Engineering, Southeast University, Nanjing, China
Q
Qimin Xu
School of Instrument Science and Engineering, Southeast University, Nanjing, China
B
Benwu Wang
School of Instrument Science and Engineering, Southeast University, Nanjing, China
Kun Wei
Kun Wei
School of Computer Science, Northwestern Polytechnical University
deep learningcompute sciencespeech
Y
Yiming Peng
School of Instrument Science and Engineering, Southeast University, Nanjing, China
Z
Zihang Wang
School of Instrument Science and Engineering, Southeast University, Nanjing, China