🤖 AI Summary
To address artifacts and perceptual inconsistencies in 3D Gaussian Splatting—stemming from inaccurate specular modeling under complex lighting, insufficient surface property representation, and biased view-dependent effect modeling—this paper proposes a perception-enhanced and geometrically precise framework for structured 3D Gaussians. Our method introduces three key innovations: (1) a Local-Enhanced Multi-head Self-Attention (LEMSA) mechanism replacing spherical harmonics to improve specular and material perception; (2) a Kolmogorov–Arnold Network (KAN) for dynamic optimization of Gaussian opacity and covariance, enhancing geometric fidelity; and (3) a Neural Laplacian Pyramid (NLPD) to enforce cross-view perceptual consistency. Evaluated on multiple benchmarks, our approach achieves significant improvements in specular reflection accuracy, fine-detail preservation, and geometric realism, while effectively suppressing spurious geometry. It comprehensively outperforms state-of-the-art methods.
📝 Abstract
Recent advances in structured 3D Gaussians for view-adaptive rendering, particularly through methods like Scaffold-GS, have demonstrated promising results in neural scene representation. However, existing approaches still face challenges in perceptual consistency and precise view-dependent effects. We present PEP-GS, a novel framework that enhances structured 3D Gaussians through three key innovations: (1) a Local-Enhanced Multi-head Self-Attention (LEMSA) mechanism that replaces spherical harmonics for more accurate view-dependent color decoding, and (2) Kolmogorov-Arnold Networks (KAN) that optimize Gaussian opacity and covariance functions for enhanced interpretability and splatting precision. (3) a Neural Laplacian Pyramid Decomposition (NLPD) that improves perceptual similarity across views. Our comprehensive evaluation across multiple datasets indicates that, compared to the current state-of-the-art methods, these improvements are particularly evident in challenging scenarios such as view-dependent effects, specular reflections, fine-scale details and false geometry generation.