HccePose(BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation

📅 2025-10-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing pose estimation methods predominantly focus on predicting 3D coordinates of the object’s front surface, neglecting the geometric modeling potential of the back surface and interior volume. This work proposes the Hierarchical Continuous Coordinate Encoding (HCCE), the first method to jointly model both front and back surfaces along with a dense volumetric representation between them. HCCE employs a neural network to predict front and back surface depths in tandem, followed by dense 3D sampling within the bounded volume to establish ultra-dense 2D–3D correspondences. The encoding enables high-precision, low-spectral-bias coordinate embedding, and is integrated with an optimized PnP solver to enhance pose robustness. Evaluated on all seven core datasets of the BOP benchmark, our approach achieves new state-of-the-art performance, improving average ADD(-S) accuracy by 2.1–5.7 percentage points—demonstrating the effectiveness and advancement of leveraging full 3D spatial geometry for pose estimation.

Technology Category

Application Category

📝 Abstract
In pose estimation for seen objects, a prevalent pipeline involves using neural networks to predict dense 3D coordinates of the object surface on 2D images, which are then used to establish dense 2D-3D correspondences. However, current methods primarily focus on more efficient encoding techniques to improve the precision of predicted 3D coordinates on the object's front surface, overlooking the potential benefits of incorporating the back surface and interior of the object. To better utilize the full surface and interior of the object, this study predicts 3D coordinates of both the object's front and back surfaces and densely samples 3D coordinates between them. This process creates ultra-dense 2D-3D correspondences, effectively enhancing pose estimation accuracy based on the Perspective-n-Point (PnP) algorithm. Additionally, we propose Hierarchical Continuous Coordinate Encoding (HCCE) to provide a more accurate and efficient representation of front and back surface coordinates. Experimental results show that, compared to existing state-of-the-art (SOTA) methods on the BOP website, the proposed approach outperforms across seven classic BOP core datasets. Code is available at https://github.com/WangYuLin-SEU/HCCEPose.
Problem

Research questions and friction points this paper is trying to address.

Predicting front and back object surfaces for pose estimation
Creating ultra-dense 2D-3D correspondences using interior sampling
Improving pose accuracy through hierarchical coordinate encoding
Innovation

Methods, ideas, or system contributions that make the work stand out.

Predicts front and back surface 3D coordinates
Creates ultra-dense 2D-3D correspondences via sampling
Uses Hierarchical Continuous Coordinate Encoding technique
🔎 Similar Papers
No similar papers found.