SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM

๐Ÿ“… 2024-03-12
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 31
โœจ Influential: 6
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses key challenges in dense semantic SLAMโ€”low 3D semantic mapping accuracy, camera tracking drift, and poor novel-view rendering qualityโ€”by proposing the first end-to-end framework integrating semantic feature embedding into 3D Gaussian representations. Methodologically: (1) we introduce a semantic-enhanced 3D Gaussian representation, encoding pixel-level semantic features into Gaussian ellipsoid attributes; (2) we design a feature-level supervised loss to jointly optimize geometric and semantic consistency; and (3) we propose semantic-correlation-driven multi-frame bundle adjustment to significantly suppress pose accumulation errors. Evaluated on Replica and ScanNet, our method outperforms state-of-the-art NeRF-based SLAM approaches across mapping accuracy, tracking robustness, and novel-view synthesis quality, while enabling high-fidelity dense semantic segmentation and editable semantic 3D map reconstruction.

Technology Category

Application Category

๐Ÿ“ Abstract
We propose SemGauss-SLAM, a dense semantic SLAM system utilizing 3D Gaussian representation, that enables accurate 3D semantic mapping, robust camera tracking, and high-quality rendering simultaneously. In this system, we incorporate semantic feature embedding into 3D Gaussian representation, which effectively encodes semantic information within the spatial layout of the environment for precise semantic scene representation. Furthermore, we propose feature-level loss for updating 3D Gaussian representation, enabling higher-level guidance for 3D Gaussian optimization. In addition, to reduce cumulative drift in tracking and improve semantic reconstruction accuracy, we introduce semantic-informed bundle adjustment leveraging multi-frame semantic associations for joint optimization of 3D Gaussian representation and camera poses, leading to low-drift tracking and accurate mapping. Our SemGauss-SLAM method demonstrates superior performance over existing radiance field-based SLAM methods in terms of mapping and tracking accuracy on Replica and ScanNet datasets, while also showing excellent capabilities in high-precision semantic segmentation and dense semantic mapping.
Problem

Research questions and friction points this paper is trying to address.

Enables accurate 3D semantic mapping and camera tracking
Incorporates semantic features into 3D Gaussian representation
Reduces tracking drift with semantic-informed bundle adjustment
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D Gaussian representation with semantic embedding
Feature-level loss for Gaussian optimization
Semantic-informed bundle adjustment for tracking
๐Ÿ”Ž Similar Papers
No similar papers found.