🤖 AI Summary
This work addresses the degradation in clustering accuracy and loss of discriminative power in the binary Bag-of-Words (BoW) model used in traditional ORB-SLAM, which stems from premature binarization that compromises both the distinctiveness and structural integrity of visual vocabulary, thereby impairing loop closure detection and relocalization performance. To mitigate this issue, the authors propose the HBRB-BoW algorithm, which integrates a global real-valued information flow throughout the hierarchical clustering process and defers final binarization exclusively to the leaf nodes. The approach further incorporates an enhanced hierarchical BRB-KMeans clustering scheme and an optimized ORB descriptor strategy. This design effectively alleviates the loss of fine-grained feature details, substantially improving the representational capacity of the BoW model under challenging environmental conditions and consequently enhancing the robustness of loop closure detection and relocalization in ORB-SLAM.
📝 Abstract
In visual simultaneous localization and mapping (SLAM), the quality of the visual vocabulary is fundamental to the system's ability to represent environments and recognize locations. While ORB-SLAM is a widely used framework, its binary vocabulary, trained through the k-majority-based bag-of-words (BoW) approach, suffers from inherent precision loss. The inability of conventional binary clustering to represent subtle feature distributions leads to the degradation of visual words, a problem that is compounded as errors accumulate and propagate through the hierarchical tree structure. To address these structural deficiencies, this paper proposes hierarchical binary-to-real-and-back (HBRB)-BoW, a refined hierarchical binary vocabulary training algorithm. By integrating a global real-valued flow within the hierarchical clustering process, our method preserves high-fidelity descriptor information until the final binarization at the leaf nodes. Experimental results demonstrate that the proposed approach yields a more discriminative and well-structured vocabulary than traditional methods, significantly enhancing the representational integrity of the visual dictionary in complex environments. Furthermore, replacing the default ORB-SLAM vocabulary file with our HBRB-BoW file is expected to improve performance in loop closing and relocalization tasks.