C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs

📅 2025-07-18

📈 Citations: 0

✨ Influential: 0

career value

224K/year

🤖 AI Summary

Multi-view multi-object association remains a critical challenge in 3D reconstruction—particularly in textureless, high-density scenes with low camera overlap—where appearance-based or epipolar-constraint-driven methods suffer from insufficient robustness. This paper proposes a training-free, geometry-driven framework: first, a δ-overlap graph is constructed to model spatial proximity among cross-view detection bounding boxes; second, outlier suppression and match verification are jointly performed via interquartile range (IQR) filtering and 3D back-projection error minimization; finally, epipolar-geometric consistency is used to weight graph edges, followed by δ-neighborhood clustering for robust instance association. Crucially, the method operates entirely without appearance features. It significantly outperforms existing geometry-based baselines under challenging conditions—including textureless surfaces and sensor noise—and demonstrates strong scalability for large-scale real-world 3D reconstruction.

Technology Category

Application Category

📝 Abstract

Multi-view multi-object association is a fundamental step in 3D reconstruction pipelines, enabling consistent grouping of object instances across multiple camera views. Existing methods often rely on appearance features or geometric constraints such as epipolar consistency. However, these approaches can fail when objects are visually indistinguishable or observations are corrupted by noise. We propose C-DOG, a training-free framework that serves as an intermediate module bridging object detection (or pose estimation) and 3D reconstruction, without relying on visual features. It combines connected delta-overlap graph modeling with epipolar geometry to robustly associate detections across views. Each 2D observation is represented as a graph node, with edges weighted by epipolar consistency. A delta-neighbor-overlap clustering step identifies strongly consistent groups while tolerating noise and partial connectivity. To further improve robustness, we incorporate Interquartile Range (IQR)-based filtering and a 3D back-projection error criterion to eliminate inconsistent observations. Extensive experiments on synthetic benchmarks demonstrate that C-DOG outperforms geometry-based baselines and remains robust under challenging conditions, including high object density, without visual features, and limited camera overlap, making it well-suited for scalable 3D reconstruction in real-world scenarios.

Problem

Research questions and friction points this paper is trying to address.

Associating visually indistinguishable objects across multiple camera views

Handling noisy observations without relying on visual features

Improving robustness in dense scenes with limited camera overlap

Innovation

Methods, ideas, or system contributions that make the work stand out.

Training-free multi-object association framework

Connected delta-overlap graph modeling

IQR-based filtering for robust grouping

🔎 Similar Papers

No similar papers found.