THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose

📅 2025-12-10

📈 Citations: 0

✨ Influential: 0

career value

158K/year

🤖 AI Summary

To address the insufficient robustness in category-level 6D object pose estimation—caused by intra-class shape deformation, occlusion, and visual ambiguity—this paper introduces, for the first time, surface embeddings to explicitly encode topological priors. We propose an Adaptive Hybrid Graph Fusion (HGF) mechanism that jointly models 2D semantic consistency and 3D geometric structure. Our method integrates surface topology embeddings, image features, point cloud geometric encodings, and a learnable graph fusion module to enable cross-modal structured reasoning. Evaluated on the REAL275 benchmark, our approach achieves a 35.8% improvement over the 3D-GC baseline HS-Pose and surpasses prior state-of-the-art methods by 7.2% in average precision. It demonstrates significantly enhanced generalization capability and pose estimation accuracy under challenging real-world conditions, including severe occlusion and large intra-class variations.

Technology Category

Application Category

📝 Abstract

Category-level object pose estimation requires both global context and local structure to ensure robustness against intra-class variations. However, 3D graph convolution (3D-GC) methods only focus on local geometry and depth information, making them vulnerable to complex objects and visual ambiguities. To address this, we present THE-Pose, a novel category-level 6D pose estimation framework that leverages a topological prior via surface embedding and hybrid graph fusion. Specifically, we extract consistent and invariant topological features from the image domain, effectively overcoming the limitations inherent in existing 3D-GC based methods. Our Hybrid Graph Fusion (HGF) module adaptively integrates the topological features with point-cloud features, seamlessly bridging 2D image context and 3D geometric structure. These fused features ensure stability for unseen or complicated objects, even under significant occlusions. Extensive experiments on the REAL275 dataset show that THE-Pose achieves a 35.8% improvement over the 3D-GC baseline (HS-Pose) and surpasses the previous state-of-the-art by 7.2% across all key metrics. The code is avaialbe on https://github.com/EHxxx/THE-Pose

Problem

Research questions and friction points this paper is trying to address.

Estimates 6D object poses for categories with intra-class variations

Overcomes limitations of 3D graph convolution methods using topological priors

Integrates 2D image context with 3D geometry via hybrid graph fusion

Innovation

Methods, ideas, or system contributions that make the work stand out.

Leverages topological prior via surface embedding and hybrid graph fusion

Integrates topological features with point-cloud features adaptively

Bridges 2D image context and 3D geometric structure seamlessly

🔎 Similar Papers

No similar papers found.