MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene

πŸ“… 2026-04-20
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

192K/year
πŸ€– AI Summary
This work addresses the vulnerability of Generalizable Neural Radiance Fields (GeNeRF) to transient occluders under sparse-view settings, which often leads to cross-view structural inconsistencies and degraded reconstruction quality. To mitigate this issue, the authors propose a multi-view uncertainty-guided GeNeRF framework that, for the first time, decomposes uncertainty into structural discrepancies from source views and observation anomalies in target views. By integrating a heteroscedastic reconstruction loss, the method adaptively modulates supervision signals based on estimated uncertainties. This approach effectively suppresses the adverse effects of transient interference, significantly enhancing geometric consistency and reconstruction robustness. Extensive experiments demonstrate that the proposed method outperforms existing generalizable NeRF approaches across multiple datasets and achieves performance comparable to scene-specific optimized NeRFs designed for occlusion-free environments.

Technology Category

Application Category

πŸ“ Abstract
Generalizable Neural Radiance Fields (GeNeRFs) enable high-quality scene reconstruction from sparse views and can generalize to unseen scenes. However, in real-world settings, transient distractors break cross-view structural consistency, corrupting supervision and degrading reconstruction quality. Existing distractor-free NeRF methods rely on per-scene optimization and estimate uncertainty from per-view reconstruction errors, which are not reliable for GeNeRFs and often misjudge inconsistent static structures as distractors. To this end, we propose MU-GeNeRF, a Multi-view Uncertainty-guided distractor-aware GeNeRF framework designed to alleviate GeNeRF's robust modeling challenges in the presence of transient distractions. We decompose distractor awareness into two complementary uncertainty components: Source-view Uncertainty, which captures structural discrepancies across source views caused by viewpoint changes or dynamic factors; and Target-view Uncertainty, which detects observation anomalies in the target image induced by transient distractors.These two uncertainties address distinct error sources and are combined through a heteroscedastic reconstruction loss, which guides the model to adaptively modulate supervision, enabling more robust distractor suppression and geometric modeling.Extensive experiments show that our method not only surpasses existing GeNeRFs but also achieves performance comparable to scene-specific distractor-free NeRFs.
Problem

Research questions and friction points this paper is trying to address.

transient distractors
Generalizable Neural Radiance Fields
multi-view consistency
scene reconstruction
uncertainty estimation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-view Uncertainty
Generalizable NeRF
Distractor-aware
Heteroscedastic Loss
Neural Radiance Fields
πŸ”Ž Similar Papers
No similar papers found.
W
Wenjie Mu
School of Computer Science and Technology, Tongji University, Shanghai, China
Zhan Li
Zhan Li
BASF Digital Farming GmbH
Earth ScienceTerrestrial EcosystemForestWetlandRemote Sensing
C
Chuanzhou Su
School of Computer Science and Technology, Tongji University, Shanghai, China
X
Xuanyi Shen
School of Computer Science and Technology, Tongji University, Shanghai, China
Z
Ziniu Liu
School of Computer Science and Technology, Tongji University, Shanghai, China
F
Fan Lu
School of Computer Science and Technology, Tongji University, Shanghai, China
Yujian Mo
Yujian Mo
Tongji University
εŽ‹εŠ›ε€§εˆ°η‘δΈη€οΌοΌοΌ
Junqiao Zhao
Junqiao Zhao
Department of Computer science and technology, Tongji University
SLAMLocalizationReinforcement LearningAutonomous Driving
Tiantian Feng
Tiantian Feng
Postdoc Researcher
Health and BehaviorsWearable ComputingAffective ComputingSpeech and BiosignalResponsible ML
Chen Ye
Chen Ye
The Key Laboratory of Embedded System and Service Computing, Tongji University, Shanghai, China
Computer VisionMachine LearningApplications
Guang Chen
Guang Chen
Tongji University
Embodied AIMachine VisionRoboticsAutonomous Driving