RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

📅 2026-03-15

📈 Citations: 0

✨ Influential: 0

career value

187K/year

🤖 AI Summary

This work addresses the challenges of accuracy and efficiency in large-scale outdoor LiDAR point cloud registration, which arise from the massive number of points, complex spatial distributions, and abundant outliers. To this end, we propose an end-to-end differentiable Transformer-based registration network. Our method first converts 3D point clouds into a 2D structure via cylindrical projection and introduces a novel projection-aware 2D Transformer with linear complexity for efficient global feature extraction. Furthermore, we integrate a bijective correspondence Transformer and an optimal transport module for robust bidirectional point matching. Extensive experiments on the KITTI, NuScenes, and Argoverse datasets demonstrate that our approach significantly outperforms existing methods in both registration accuracy and computational efficiency.

Technology Category

Application Category

📝 Abstract

Although point cloud registration has achieved remarkable advances in object-level and indoor scenes, large-scale LiDAR registration methods has been rarely explored before. Challenges mainly arise from the huge point scale, complex point distribution, and numerous outliers within outdoor LiDAR scans. In addition, most existing registration works generally adopt a two-stage paradigm: They first find correspondences by extracting discriminative local descriptors and then leverage robust estimators (e.g. RANSAC) to filter outliers, which are highly dependent on well-designed descriptors and post-processing choices. To address these problems, we propose a novel end-to-end differential transformer network, termed RegFormer++, for large-scale point cloud alignment without requiring any further post-processing. Specifically, a hierarchical projection-aware 2D transformer with linear complexity is proposed to project raw LiDAR points onto a cylindrical surface and extract global point features, which can improve resilience to outliers due to long-range dependencies. Because we fill original 3D coordinates into 2D projected positions, our designed transformer can benefit from both high efficiency in 2D processing and accuracy from 3D geometric information. Furthermore, to effectively reduce wrong point matching, a Bijective Association Transformer (BAT) is designed, combining both cross attention and all-to-all point gathering. To improve training stability and robustness, a feature-transformed optimal transport module is also designed for regressing the final pose transformation. Extensive experiments on KITTI, NuScenes, and Argoverse datasets demonstrate that our model achieves state-of-the-art performance in terms of both accuracy and efficiency.

Problem

Research questions and friction points this paper is trying to address.

large-scale LiDAR registration

point cloud registration

outdoor point clouds

outlier resilience

correspondence estimation

Innovation

Methods, ideas, or system contributions that make the work stand out.

projection-aware transformer

large-scale LiDAR registration

bijective association