NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods

📅 2024-06-25
🏛️ arXiv.org
📈 Citations: 16
Influential: 0
📄 PDF

career value

183K/year
🤖 AI Summary
New view synthesis evaluation suffers from three major challenges: inconsistent evaluation protocols, poor reproducibility, and limited cross-scene generalizability—leading to unreliable state-of-the-art tracking and questionable quantitative comparability. To address these issues, we propose the first standardized evaluation framework for neural radiance fields (NeRF) and 3D Gaussian splatting (3DGS). Our framework (1) establishes a cross-method consistent evaluation protocol; (2) enables one-click reproduction of multiple algorithms with CI-driven automated result validation; and (3) provides an interactive web platform for transparent, horizontal benchmarking across eight+ methods on five+ standard scenes. Implemented in Python/PyTorch, it unifies data loading, rendering, and evaluation pipelines. Experiments reproduce over ten mainstream methods and significantly reduce evaluation variance across multiple datasets, establishing a robust, reproducible benchmark for new view synthesis research.

Technology Category

Application Category

📝 Abstract
Novel view synthesis is an important problem with many applications, including AR/VR, gaming, and simulations for robotics. With the recent rapid development of Neural Radiance Fields (NeRFs) and 3D Gaussian Splatting (3DGS) methods, it is becoming difficult to keep track of the current state of the art (SoTA) due to methods using different evaluation protocols, codebases being difficult to install and use, and methods not generalizing well to novel 3D scenes. Our experiments support this claim by showing that tiny differences in evaluation protocols of various methods can lead to inconsistent reported metrics. To address these issues, we propose a framework called NerfBaselines, which simplifies the installation of various methods, provides consistent benchmarking tools, and ensures reproducibility. We validate our implementation experimentally by reproducing numbers reported in the original papers. To further improve the accessibility, we release a web platform where commonly used methods are compared on standard benchmarks. Web: https://jkulhanek.com/nerfbaselines
Problem

Research questions and friction points this paper is trying to address.

Addressing inconsistent evaluation protocols in novel view synthesis
Ensuring reproducibility and comparability of quantitative results
Simplifying installation and usage of diverse NVS methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Framework provides consistent benchmarking tools
Ensures reproducibility and simplifies method usage
Releases web platform for method comparison
🔎 Similar Papers