🤖 AI Summary
This work addresses the challenge of high-quality, real-time 3D reconstruction and appearance control from unconstrained images—where camera poses are unknown and lighting conditions vary significantly—by proposing a feed-forward 3D Gaussian Splatting model. The method jointly learns 3D Gaussians and image-conditioned appearance embeddings, enabling end-to-end sparse-view reconstruction and flexible appearance modulation without requiring known poses or consistent illumination. As the first approach to support end-to-end 3D Gaussian Splatting under pose-free, multi-illumination settings, it substantially outperforms existing pose-free 3DGS methods on real-world datasets, achieving reconstruction in under one second while enabling diverse appearance editing capabilities.
📝 Abstract
We propose WildSplatter, a feed-forward 3D Gaussian Splatting (3DGS) model for unconstrained images with unknown camera parameters and varying lighting conditions. 3DGS is an effective scene representation that enables high-quality, real-time rendering; however, it typically requires iterative optimization and multi-view images captured under consistent lighting with known camera parameters. WildSplatter is trained on unconstrained photo collections and jointly learns 3D Gaussians and appearance embeddings conditioned on input images. This design enables flexible modulation of Gaussian colors to represent significant variations in lighting and appearance. Our method reconstructs 3D Gaussians from sparse input views in under one second, while also enabling appearance control under diverse lighting conditions. Experimental results demonstrate that our approach outperforms existing pose-free 3DGS methods on challenging real-world datasets with varying illumination.