Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach

📅 2025-01-31
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address navigation failures in robots caused by inaccuracies in hand-drawn maps—such as scale distortion and missing landmarks—this paper proposes a robust autonomous navigation method tailored for hand-drawn maps. The method integrates pre-trained vision-language models (VLMs), topological map representations, and predictive navigation planning. Its core contributions are: (1) a novel Selective Visual Association Prompting mechanism that enables cross-modal topological localization and path planning; and (2) a Predictive Navigation Plan Parser that automatically infers and restores missing landmarks using semantic and spatial priors. Evaluated in both rasterized simulation and real-world user studies, the approach significantly improves navigation success rate and Success weighted by Path Length (SPL). Results demonstrate strong generalization across diverse drawing styles, robot morphologies, and environmental scenarios, as well as cross-platform robustness.

Technology Category

Application Category

📝 Abstract
Hand-drawn maps can be used to convey navigation instructions between humans and robots in a natural and efficient manner. However, these maps can often contain inaccuracies such as scale distortions and missing landmarks which present challenges for mobile robot navigation. This paper introduces a novel Hand-drawn Map Navigation (HAM-Nav) architecture that leverages pre-trained vision language models (VLMs) for robot navigation across diverse environments, hand-drawing styles, and robot embodiments, even in the presence of map inaccuracies. HAM-Nav integrates a unique Selective Visual Association Prompting approach for topological map-based position estimation and navigation planning as well as a Predictive Navigation Plan Parser to infer missing landmarks. Extensive experiments were conducted in photorealistic simulated environments, using both wheeled and legged robots, demonstrating the effectiveness of HAM-Nav in terms of navigation success rates and Success weighted by Path Length. Furthermore, a user study in real-world environments highlighted the practical utility of hand-drawn maps for robot navigation as well as successful navigation outcomes.
Problem

Research questions and friction points this paper is trying to address.

Robot Navigation
Hand-drawn Maps
Map Errors
Innovation

Methods, ideas, or system contributions that make the work stand out.

HAM-Nav
Adaptive Navigation
Robotics Mapping
🔎 Similar Papers
No similar papers found.
Aaron Hao Tan
Aaron Hao Tan
Stanford University
Robotics
Angus Fung
Angus Fung
University of Toronto
Haitong Wang
Haitong Wang
University of Toronto
RoboticsReinforcement LearningImitation Learning
G
G. Nejat
Autonomous Systems and Biomechatronics Laboratory (ASBLab), Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, ON M5S 3G8, Canada