A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields

๐Ÿ“… 2025-07-06
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
NeRF training on real-world outdoor scenes suffers from instability due to inaccurate depth estimation, while existing depth regularization methods rely on costly 3D supervision and exhibit poor generalization. To address this, we propose a view-consistent implicit regularization framework: instead of enforcing fixed depth values, we construct a probabilistic view-consistency distribution over ray sampling points via multi-view 2D pixel projections, and introduceโ€” for the first timeโ€”a depth-pushing loss to suppress spurious geometric structures, eliminating dependence on precise depth labels. Our method jointly learns distribution modeling and geometric optimization by fusing high-level semantic features from foundation models with low-level color features. Evaluated on multiple public benchmarks, our approach significantly outperforms state-of-the-art NeRF variants and depth-regularized methods, achieving substantial improvements in novel-view synthesis quality and reconstruction robustness.

Technology Category

Application Category

๐Ÿ“ Abstract
Neural Radiance Fields (NeRF) has emerged as a compelling framework for scene representation and 3D recovery. To improve its performance on real-world data, depth regularizations have proven to be the most effective ones. However, depth estimation models not only require expensive 3D supervision in training, but also suffer from generalization issues. As a result, the depth estimations can be erroneous in practice, especially for outdoor unbounded scenes. In this paper, we propose to employ view-consistent distributions instead of fixed depth value estimations to regularize NeRF training. Specifically, the distribution is computed by utilizing both low-level color features and high-level distilled features from foundation models at the projected 2D pixel-locations from per-ray sampled 3D points. By sampling from the view-consistency distributions, an implicit regularization is imposed on the training of NeRF. We also utilize a depth-pushing loss that works in conjunction with the sampling technique to jointly provide effective regularizations for eliminating the failure modes. Extensive experiments conducted on various scenes from public datasets demonstrate that our proposed method can generate significantly better novel view synthesis results than state-of-the-art NeRF variants as well as different depth regularization methods.
Problem

Research questions and friction points this paper is trying to address.

Improving NeRF performance with depth regularization
Addressing depth estimation errors in outdoor scenes
Using view-consistent distributions for NeRF training
Innovation

Methods, ideas, or system contributions that make the work stand out.

View-consistent distributions replace fixed depth values
Utilizes color and distilled foundation model features
Depth-pushing loss combined with sampling for regularization
๐Ÿ”Ž Similar Papers
No similar papers found.
Aoxiang Fan
Aoxiang Fan
EPFL
computer visionmachine learning
C
Corentin Dumery
Computer Vision Laboratory, EPFL, Switzerland
N
Nicolas Talabot
Computer Vision Laboratory, EPFL, Switzerland
Pascal Fua
Pascal Fua
Professor Computer Science, EPFL
Computer VisionMachine LearningComputer Asisted Eng.Biomedical Imaging