See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization

πŸ“… 2025-01-20
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address geometric distortion, detail loss, and inter-view inconsistency in 3D Gaussian Splatting (3DGS) reconstruction under sparse-view settings, this paper proposes a novel method integrating local depth constraints with multi-view semantic consistency regularization. We innovatively incorporate self-supervised visual features extracted by DINO-ViT into the 3DGS optimization framework to formulate a semantic consistency lossβ€”marking the first such integration in 3DGS. Additionally, we introduce a local depth smoothness regularization term to mitigate ill-posedness arising from sparse input views. Evaluated on the LLFF dataset, our method achieves a 0.4 dB PSNR improvement over baseline methods. It significantly suppresses geometric distortions, enhances structural accuracy and texture fidelity in novel-view synthesis, and improves the robustness and generalization capability of 3D reconstruction under sparse-view conditions.

Technology Category

Application Category

πŸ“ Abstract
3D Gaussian Splatting (3DGS) has shown remarkable performance in novel view synthesis. However, its rendering quality deteriorates with sparse inphut views, leading to distorted content and reduced details. This limitation hinders its practical application. To address this issue, we propose a sparse-view 3DGS method. Given the inherently ill-posed nature of sparse-view rendering, incorporating prior information is crucial. We propose a semantic regularization technique, using features extracted from the pretrained DINO-ViT model, to ensure multi-view semantic consistency. Additionally, we propose local depth regularization, which constrains depth values to improve generalization on unseen views. Our method outperforms state-of-the-art novel view synthesis approaches, achieving up to 0.4dB improvement in terms of PSNR on the LLFF dataset, with reduced distortion and enhanced visual quality.
Problem

Research questions and friction points this paper is trying to address.

3D image technology
view synthesis quality
limited original views
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D Gaussian Splattering
Semantic Regularization
Local Depth Regularization
πŸ”Ž Similar Papers
No similar papers found.
Zongqi He
Zongqi He
Student, The Hong Kong Polytechnic University
Computer version3D reconstructionLow-level vision
Zhe Xiao
Zhe Xiao
The Hong Kong Polytechnic University
Computer VisionImage ProcessingNeural Rendering
K
Kin-Chung Chan
Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University
Yushen Zuo
Yushen Zuo
The Hong Kong Polytechnic University
Computer visionDeep learningImage Generation
J
Jun Xiao
Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University
K
Kin-Man Lam
Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University