OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting

📅 2025-09-27

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

Sparse-view novel view synthesis is fundamentally ill-posed due to geometric ambiguity: regression-based methods preserve geometry faithfully but lack completeness, whereas generative methods enable scene completion yet often introduce structural inconsistencies. To address this, we propose a “generative–verificatory” framework that, for the first time, synergistically integrates a pre-trained 3D-aware diffusion model with multi-view stereo (MVS) attention—leveraging the diffusion model to supply semantically coherent scene priors, and employing MVS attention maps as a geometric oracle to quantify 3D uncertainty and guide Gaussian splatting optimization. We further design an uncertainty-weighted loss that adaptively fuses generative priors with geometric evidence, effectively suppressing hallucinated artifacts while ensuring geometrically plausible completion. On the Mip-NeRF 360 and NeRF Synthetic benchmarks, our method achieves state-of-the-art performance in both reconstruction completeness and geometric accuracy.

Technology Category

Application Category

📝 Abstract

Sparse-view novel view synthesis is fundamentally ill-posed due to severe geometric ambiguity. Current methods are caught in a trade-off: regressive models are geometrically faithful but incomplete, whereas generative models can complete scenes but often introduce structural inconsistencies. We propose OracleGS, a novel framework that reconciles generative completeness with regressive fidelity for sparse view Gaussian Splatting. Instead of using generative models to patch incomplete reconstructions, our "propose-and-validate" framework first leverages a pre-trained 3D-aware diffusion model to synthesize novel views to propose a complete scene. We then repurpose a multi-view stereo (MVS) model as a 3D-aware oracle to validate the 3D uncertainties of generated views, using its attention maps to reveal regions where the generated views are well-supported by multi-view evidence versus where they fall into regions of high uncertainty due to occlusion, lack of texture, or direct inconsistency. This uncertainty signal directly guides the optimization of a 3D Gaussian Splatting model via an uncertainty-weighted loss. Our approach conditions the powerful generative prior on multi-view geometric evidence, filtering hallucinatory artifacts while preserving plausible completions in under-constrained regions, outperforming state-of-the-art methods on datasets including Mip-NeRF 360 and NeRF Synthetic.

Problem

Research questions and friction points this paper is trying to address.

Addresses geometric ambiguity in sparse-view novel view synthesis

Reconciles generative completeness with regressive reconstruction fidelity

Filters hallucinatory artifacts while preserving plausible scene completions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Leverages diffusion model to synthesize novel views

Uses MVS model as oracle to validate 3D uncertainties

Guides Gaussian Splatting via uncertainty-weighted loss

🔎 Similar Papers

LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors