🤖 AI Summary
This paper addresses the geometric and semantic misalignment between LiDAR point clouds and BIM models—arising from heterogeneous data sources and inconsistent coordinate systems—by proposing a cross-modal precise registration method. Methodologically, it introduces a novel triangular mesh descriptor based on wall and corner extraction, integrated with a hierarchical Hough voting scheme and occupancy-aware scoring validation to achieve scale-invariant matching and robust multi-hypothesis pose estimation. Evaluated on large-scale campus buildings across multiple LiDAR scanning sessions, the method achieves sub-decimeter (<0.1 m) global coordinate system registration accuracy for both terrestrial and mobile LiDAR data. The paper also releases the first benchmark dataset and open-source implementation specifically designed for LiDAR–BIM registration, establishing a reproducible technical pipeline and practical toolkit for cross-modal collaborative modeling.
📝 Abstract
Light detection and ranging (LiDAR) point clouds and building information modeling (BIM) represent two distinct data modalities in the fields of robot perception and construction. These modalities originate from different sources and are associated with unique reference frames. The primary goal of this study is to align these modalities within a shared reference frame using a global registration approach, effectively enabling them to ``speak the same language''. To achieve this, we propose a cross-modality registration method, spanning from the front end to the back end. At the front end, we extract triangle descriptors by identifying walls and intersected corners, enabling the matching of corner triplets with a complexity independent of the BIM's size. For the back-end transformation estimation, we utilize the Hough transform to map the matched triplets to the transformation space and introduce a hierarchical voting mechanism to hypothesize multiple pose candidates. The final transformation is then verified using our designed occupancy-aware scoring method. To assess the effectiveness of our approach, we conducted real-world multi-session experiments in a large-scale university building, employing two different types of LiDAR sensors. We make the collected datasets and codes publicly available to benefit the community.