PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching

📅 2026-03-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a lightweight, texture- and pose-prior-free 6-DoF camera relocalization method tailored for structured indoor environments. It introduces 3D planar primitives as region-level structural semantic representations and establishes cross-modal correspondences between query images and a sparse plane-based map through a deep matcher operating in a unified embedding space. Camera poses are then recovered via robust optimization, eliminating the need for photorealistic textures, initial pose estimates, or scene-specific training. Evaluated on multiple benchmarks including ScanNet and 12Scenes, the framework achieves high accuracy and efficiency, demonstrating the effectiveness and generalizability of a structure-primitive-based relocalization paradigm.

Technology Category

Application Category

📝 Abstract
While structure-based relocalizers have long strived for point correspondences when establishing or regressing query-map associations, in this paper, we pioneer the use of planar primitives and 3D planar maps for lightweight 6-DoF camera relocalization in structured environments. Planar primitives, beyond being fundamental entities in projective geometry, also serve as region-based representations that encapsulate both structural and semantic richness. This motivates us to introduce PlanaReLoc, a streamlined plane-centric paradigm where a deep matcher associates planar primitives across the query image and the map within a learned unified embedding space, after which the 6-DoF pose is solved and refined under a robust framework. Through comprehensive experiments on the ScanNet and 12Scenes datasets across hundreds of scenes, our method demonstrates the superiority of planar primitives in facilitating reliable cross-modal structural correspondences and achieving effective camera relocalization without requiring realistically textured/colored maps, pose priors, or per-scene training. The code and data are available at https://github.com/3dv-casia/PlanaReLoc .
Problem

Research questions and friction points this paper is trying to address.

camera relocalization
6-DoF pose estimation
planar primitives
structure matching
3D mapping
Innovation

Methods, ideas, or system contributions that make the work stand out.

planar primitives
camera relocalization
structure matching
6-DoF pose estimation
region-based representation
🔎 Similar Papers
No similar papers found.
H
Hanqiao Ye
School of Artificial Intelligence, University of Chinese Academy of Sciences; Institute of Automation, Chinese Academy of Sciences
Yuzhou Liu
Yuzhou Liu
Amazon
audio processingspeech processingmachine learning
Y
Yangdong Liu
Institute of Automation, Chinese Academy of Sciences
Shuhan Shen
Shuhan Shen
Institute of Automation, Chinese Academy of Sciences
3D Computer VisionPhotogrammetry3D Modeling