MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency

📅 2025-10-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Monocular 3D foundation models suffer from scale ambiguity, leading to cross-view geometric inconsistency and inaccurate scaling. To address this, we propose a training-free geometric optimization method. Our approach establishes inter-frame feature correspondences to infer cross-view point matches, approximates local surface geometry via planar priors, and formulates a graph-based optimization framework that explicitly enforces multi-frame geometric consistency constraints. Crucially, we couple graph optimization with local planarity regularization—without altering the original 3D representation or requiring additional training—thereby effectively mitigating scale ambiguity and enhancing geometric fidelity. Experiments demonstrate significant improvements in sparse-view 3D reconstruction accuracy and novel-view synthesis quality, particularly in cross-view consistency and scale alignment.

Technology Category

Application Category

📝 Abstract
Monocular 3D foundation models offer an extensible solution for perception tasks, making them attractive for broader 3D vision applications. In this paper, we propose MoRe, a training-free Monocular Geometry Refinement method designed to improve cross-view consistency and achieve scale alignment. To induce inter-frame relationships, our method employs feature matching between frames to establish correspondences. Rather than applying simple least squares optimization on these matched points, we formulate a graph-based optimization framework that performs local planar approximation using the estimated 3D points and surface normals estimated by monocular foundation models. This formulation addresses the scale ambiguity inherent in monocular geometric priors while preserving the underlying 3D structure. We further demonstrate that MoRe not only enhances 3D reconstruction but also improves novel view synthesis, particularly in sparse view rendering scenarios.
Problem

Research questions and friction points this paper is trying to address.

Refining monocular 3D geometry for cross-view consistency
Addressing scale ambiguity in monocular geometric priors
Improving 3D reconstruction and sparse view synthesis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Graph optimization refines monocular geometry via cross-view consistency
Feature matching establishes inter-frame correspondences for scale alignment
Local planar approximation preserves 3D structure using surface normals
🔎 Similar Papers
No similar papers found.