3D Scene Change Modeling With Consistent Multi-View Aggregation

πŸ“… 2025-12-28
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing 3D change detection methods suffer from spatial inconsistency and ambiguity in distinguishing pre- and post-change states. To address these issues, we propose SCaR-3D, the first framework leveraging Signed Distance Field (SDF)-guided 2D differential modeling and multi-view consistency voting for object-level, direction-aware change localization. We further introduce a dynamic region selection and update strategy to enable continual scene reconstruction. Additionally, we construct CCS3Dβ€”the first controllable, compositional synthetic dataset for 3D change detection. Our method integrates multi-view 3D Gaussian Splatting (3DGS) reconstruction, SDF-based differential representation, voting-based aggregation, and incremental optimization. On CCS3D, SCaR-3D significantly outperforms state-of-the-art methods: it achieves a +12.6% mIoU gain, 2.3Γ— faster inference, and simultaneously ensures high accuracy, computational efficiency, and structural fidelity.

Technology Category

Application Category

πŸ“ Abstract
Change detection plays a vital role in scene monitoring, exploration, and continual reconstruction. Existing 3D change detection methods often exhibit spatial inconsistency in the detected changes and fail to explicitly separate pre- and post-change states. To address these limitations, we propose SCaR-3D, a novel 3D scene change detection framework that identifies object-level changes from a dense-view pre-change image sequence and sparse-view post-change images. Our approach consists of a signed-distance-based 2D differencing module followed by multi-view aggregation with voting and pruning, leveraging the consistent nature of 3DGS to robustly separate pre- and post-change states. We further develop a continual scene reconstruction strategy that selectively updates dynamic regions while preserving the unchanged areas. We also contribute CCS3D, a challenging synthetic dataset that allows flexible combinations of 3D change types to support controlled evaluations. Extensive experiments demonstrate that our method achieves both high accuracy and efficiency, outperforming existing methods.
Problem

Research questions and friction points this paper is trying to address.

Detects object-level changes from dense and sparse image sequences
Separates pre- and post-change states using consistent multi-view aggregation
Enables continual reconstruction by selectively updating dynamic scene regions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-view aggregation with voting and pruning
Signed-distance-based 2D differencing module
Selective continual reconstruction of dynamic regions
πŸ”Ž Similar Papers
Zirui Zhou
Zirui Zhou
Huawei Technologies Canada
Mathematical OptimizationDesign and Analysis of AlgorithmsMachine Learning
Junfeng Ni
Junfeng Ni
Tsinghua University
Computer Vision3D Reconstruction
S
Shujie Zhang
Tsinghua University
Y
Yixin Chen
State Key Laboratory of General Artificial Intelligence, BIGAI
S
Siyuan Huang
State Key Laboratory of General Artificial Intelligence, BIGAI