GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction

📅 2025-04-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitation of existing next-best-view (NBV) selection methods in real-time active 3D reconstruction, which prioritize geometric completeness while neglecting visual rendering uncertainty. We propose GauSS-MI—the first pixel-level visual-quality-oriented uncertainty metric—by embedding Shannon mutual information into the 3D Gaussian Splatting framework, enabling differentiable, real-time, and semantics-aware NBV planning. Our method jointly optimizes viewpoint selection and camera motion through probabilistic modeling and active perception. Evaluated across diverse synthetic and real-world scenes, GauSS-MI achieves significant improvements in visual reconstruction quality (↑ PSNR/SSIM), reduces required sampling by 37%, and maintains >20 FPS real-time performance—thereby overcoming the fundamental constraints of conventional geometry-driven paradigms.

Technology Category

Application Category

📝 Abstract
This research tackles the challenge of real-time active view selection and uncertainty quantification on visual quality for active 3D reconstruction. Visual quality is a critical aspect of 3D reconstruction. Recent advancements such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) have notably enhanced the image rendering quality of reconstruction models. Nonetheless, the efficient and effective acquisition of input images for reconstruction-specifically, the selection of the most informative viewpoint-remains an open challenge, which is crucial for active reconstruction. Existing studies have primarily focused on evaluating geometric completeness and exploring unobserved or unknown regions, without direct evaluation of the visual uncertainty within the reconstruction model. To address this gap, this paper introduces a probabilistic model that quantifies visual uncertainty for each Gaussian. Leveraging Shannon Mutual Information, we formulate a criterion, Gaussian Splatting Shannon Mutual Information (GauSS-MI), for real-time assessment of visual mutual information from novel viewpoints, facilitating the selection of next best view. GauSS-MI is implemented within an active reconstruction system integrated with a view and motion planner. Extensive experiments across various simulated and real-world scenes showcase the superior visual quality and reconstruction efficiency performance of the proposed system.
Problem

Research questions and friction points this paper is trying to address.

Real-time active view selection for 3D reconstruction
Quantifying visual uncertainty in reconstruction models
Selecting informative viewpoints using Shannon Mutual Information
Innovation

Methods, ideas, or system contributions that make the work stand out.

Probabilistic model quantifies visual uncertainty per Gaussian
GauSS-MI assesses visual mutual information in real-time
Active reconstruction system integrates view and motion planner
🔎 Similar Papers
No similar papers found.
Yuhan Xie
Yuhan Xie
PhD student, EPFL
Deep LearningTime Series AnalysisBrain Computer Interface
Yixi Cai
Yixi Cai
Postdoctoral Fellow, Division of Robotics, Perception and Learning, KTH
RoboticsLiDARMapping
Yinqiang Zhang
Yinqiang Zhang
The University of Hong Kong
Building Information Model3D ReconstructionReinforcement Learning
L
Lei Yang
Faculty of Engineering, The University of Hong Kong, Hong Kong SAR, China
J
Jia Pan
School of Computing and Data Science, The University of Hong Kong, Hong Kong SAR, China; Centre for Transformative Garment Production, Hong Kong SAR, China