BOP-Distrib: Revisiting 6D Pose Estimation Benchmarks for Better Evaluation under Visual Ambiguities

📅 2024-08-30

📈 Citations: 1

✨ Influential: 0

career value

223K/year

🤖 AI Summary

Current 6D pose estimation benchmarks oversimplify visual ambiguities—such as symmetry and occlusion—as global object symmetries, neglecting image-level, viewpoint-dependent visibility variations, thereby misrepresenting real-world pose uncertainty. To address this, we propose a novel image-level pose distribution evaluation paradigm: (1) the first automatic pose distribution annotation method grounded in single-image surface visibility; (2) BOP-Dist, the first pose distribution benchmark tailored to realistic images; and (3) a symmetry-aware sampling strategy coupled with a distribution-aware accuracy/recall evaluation framework. After re-annotating all BOP datasets with pose distributions, we observe substantial corrections to the performance ranking of state-of-the-art single-solution methods—revealing their rankings to be highly sensitive to annotation granularity. This work establishes a physically interpretable, reproducible, and quantitative evaluation standard for multi-solution pose estimation.

Technology Category

Application Category

📝 Abstract

6D pose estimation aims at determining the object pose that best explains the camera observation. The unique solution for non-ambiguous objects can turn into a multi-modal pose distribution for symmetrical objects or when occlusions of symmetry-breaking elements happen, depending on the viewpoint. Currently, 6D pose estimation methods are benchmarked on datasets that consider, for their ground truth annotations, visual ambiguities as only related to global object symmetries, whereas they should be defined per-image to account for the camera viewpoint. We thus first propose an automatic method to re-annotate those datasets with a 6D pose distribution specific to each image, taking into account the object surface visibility in the image to correctly determine the visual ambiguities. Second, given this improved ground truth, we re-evaluate the state-of-the-art single pose methods and show that this greatly modifies the ranking of these methods. Third, as some recent works focus on estimating the complete set of solutions, we derive a precision/recall formulation to evaluate them against our image-wise distribution ground truth, making it the first benchmark for pose distribution methods on real images.

Problem

Research questions and friction points this paper is trying to address.

Improving 6D pose estimation benchmarks for visual ambiguities

Re-annotating datasets with image-specific 6D pose distributions

Evaluating pose distribution methods with precision/recall metrics

Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatic re-annotation with 6D pose distribution

Re-evaluation of state-of-the-art single pose methods

Precision/recall benchmark for pose distribution methods

🔎 Similar Papers

OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB