Topological Mapping and Navigation using a Monocular Camera based on AnyLoc

πŸ“… 2025-08-17
πŸ›οΈ 2025 IEEE 21st International Conference on Automation Science and Engineering (CASE)
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work proposes a lightweight, monocular vision-based approach for topological mapping and navigation that operates without requiring metrically accurate maps or pretrained models. Leveraging AnyLoc, the method converts keyframes into global descriptors to construct a topological graph where nodes represent environmental locations. Navigation decisions are made by matching segmented images to target nodes in the graph. To the best of our knowledge, this is the first application of AnyLoc to monocular topological navigation, significantly enhancing the system’s generalization capability and computational efficiency. Experimental results demonstrate effective loop closure and navigation performance in both real-world and simulated environments, achieving a 60.2% higher average success rate compared to a ResNet-based baseline while substantially reducing both time and memory overhead.

Technology Category

Application Category

πŸ“ Abstract
This paper proposes a method for topological mapping and navigation using a monocular camera. Based on AnyLoc, keyframes are converted into descriptors to construct topological relationships, enabling loop detection and map building. Unlike metric maps, topological maps simplify path planning and navigation by representing environments with key nodes instead of precise coordinates. Actions for visual navigation are determined by comparing segmented images with the image associated with target nodes. The system relies solely on a monocular camera, ensuring fast map building and navigation using key nodes. Experiments show effective loop detection and navigation in real and simulation environments without pre-training. Compared to a ResNet-based method, this approach improves success rates by 63.8% on average while reducing time and space costs, offering a lightweight solution for robot and human navigation in various scenarios.
Problem

Research questions and friction points this paper is trying to address.

topological mapping
monocular camera
visual navigation
loop detection
robot navigation
Innovation

Methods, ideas, or system contributions that make the work stand out.

topological mapping
monocular navigation
AnyLoc
loop detection
lightweight visual navigation
πŸ”Ž Similar Papers
No similar papers found.
Wenzheng Zhang
Wenzheng Zhang
Rutgers University
Natural Language ProcessingDeep Learning
Y
Yoshitaka Hara
Future Robotics Technology Center (fuRo), Chiba Institute of Technology, Chiba, Japan
S
Sousuke Nakamura
Faculty of Science and Engineering, Hosei University, Tokyo, Japan