UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM

📅 2024-08-31
🏛️ Array
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the inherent trade-off between geometric accuracy and rendering quality in monocular SLAM. We propose a joint optimization framework that tightly integrates monocular depth estimation with 3D Gaussian Splatting. Our key contributions are: (1) the first incorporation of the lightweight UniDepth network as a geometric depth prior into Gaussian optimization—without end-to-end coupling; (2) depth-guided adaptive Gaussian initialization and gradient masking to align geometry and appearance; (3) depth-consistency regularization to enforce geometric fidelity; and (4) a keyframe-driven dynamic Gaussian insertion and pruning mechanism for efficient scene representation. Evaluated on TUM and ScanNet, our method achieves state-of-the-art pose accuracy (ATE RMSE < 0.012 m) and real-time rendering at 45 FPS—substantially outperforming MonoGS and iMAP in both localization precision and visual quality.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Integrates UniDepth with Gaussian splatting for monocular SLAM
Eliminates need for RGB-D sensors in depth estimation
Optimizes camera trajectory and scene representation jointly
Innovation

Methods, ideas, or system contributions that make the work stand out.

UniDepth integrated with Gaussian splatting
Statistical filtering for depth consistency
Joint optimization of trajectory and scene
M
Mostafa Mansour
Faculty of Engineering and Natural Sciences, Tampere University, Finland
A
Ahmed Abdelsalam
School of Engineering Science, LUT University, Finland
A
A. Happonen
School of Engineering Science, LUT University, Finland
J
J. Porras
School of Electrical Engineering, Aalto University, Finland
Esa Rahtu
Esa Rahtu
Professor, Tampere University, Finland
Computer VisionImage UnderstandingMachine Learning