SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs

📅 2025-03-16

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

To address geometric incompleteness and insufficient supervision in 3D Gaussian Splatting (3DGS) for free-viewpoint synthesis in indoor open-world scenes under sparse input conditions, this paper proposes Semantic-Prompt Consistent Gaussian Splatting. The method introduces two key innovations: (1) Scene-layout-guided Gaussian Initialization (SGI), which leverages prior geometric structure to improve the plausibility of initial Gaussian distributions; and (2) SAM2-based cross-view Semantic Prompt Consistency regularization (SPC), jointly enforcing 2D semantic segmentation and 3D rendering constraints to enable semantic-aware reconstruction from sparse views. To our knowledge, this is the first approach to achieve high-fidelity, semantic-guided reconstruction in open-world settings with sparse inputs. Experiments on Replica and ScanNet demonstrate a 3.06 dB PSNR improvement and a 7.3% gain in open-world semantic segmentation mIoU over state-of-the-art sparse-input methods.

Technology Category

Application Category

📝 Abstract

3D Gaussian Splatting-based indoor open-world free-view synthesis approaches have shown significant performance with dense input images. However, they exhibit poor performance when confronted with sparse inputs, primarily due to the sparse distribution of Gaussian points and insufficient view supervision. To relieve these challenges, we propose SPC-GS, leveraging Scene-layout-based Gaussian Initialization (SGI) and Semantic-Prompt Consistency (SPC) Regularization for open-world free view synthesis with sparse inputs. Specifically, SGI provides a dense, scene-layout-based Gaussian distribution by utilizing view-changed images generated from the video generation model and view-constraint Gaussian points densification. Additionally, SPC mitigates limited view supervision by employing semantic-prompt-based consistency constraints developed by SAM2. This approach leverages available semantics from training views, serving as instructive prompts, to optimize visually overlapping regions in novel views with 2D and 3D consistency constraints. Extensive experiments demonstrate the superior performance of SPC-GS across Replica and ScanNet benchmarks. Notably, our SPC-GS achieves a 3.06 dB gain in PSNR for reconstruction quality and a 7.3% improvement in mIoU for open-world semantic segmentation.

Problem

Research questions and friction points this paper is trying to address.

Improves sparse input performance in 3D Gaussian Splatting.

Enhances view supervision with semantic-prompt consistency.

Achieves better reconstruction and semantic segmentation quality.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Scene-layout-based Gaussian Initialization for dense distribution

Semantic-Prompt Consistency for view supervision enhancement

2D and 3D consistency constraints for overlapping regions

🔎 Similar Papers

No similar papers found.

Netflix

The overall market range for Netflix Internships is typically $40/hour - $110/hour.

Los Gatos, CA, USA / Los Angeles, CA, USA

PhD – Generative Models for Closed-loop Synthesis

Bosch Group

Renningen, BW, DE

Authors to Follow