From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection

📅 2026-02-09
🏛️ IEEE Robotics and Automation Letters
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a socially aware navigation framework that integrates geometric path planning with a lightweight vision-language model (VLM) to address the limitations of conventional robotic navigation methods, which often prioritize obstacle avoidance while neglecting social norms and thereby disrupting human activities. The approach first generates multiple geometrically feasible paths and then employs a VLM—fine-tuned via distillation from a large foundational model on social commonsense—to evaluate and select, in real time, the path most aligned with contextual social conventions. Evaluated across four representative social scenarios, the method significantly outperforms baseline approaches, achieving the shortest duration of personal space violations, minimal time spent facing pedestrians head-on, and complete avoidance of intrusions into designated social zones, thereby enabling efficient and socially compliant navigation.

Technology Category

Application Category

📝 Abstract
Navigating socially in human environments requires more than satisfying geometric constraints, as collision-free paths may still interfere with ongoing activities or conflict with social norms. Addressing this challenge calls for analyzing interactions between agents and incorporating common-sense reasoning into planning. This paper presents a social robot navigation framework that integrates geometric planning with contextual social reasoning. The system first extracts obstacles and human dynamics to generate geometrically feasible candidate paths, then leverages a fine-tuned vision-language model (VLM) to evaluate these paths, informed by contextually grounded social expectations, selecting a socially optimized path for the controller. This task-specific VLM distills social reasoning from large foundation models into a smaller and efficient model, allowing the framework to perform real-time adaptation in diverse human-robot interaction contexts. Experiments in four social navigation contexts demonstrate that our method achieves the best overall performance with the lowest personal space violation duration, the minimal pedestrian-facing time, and no social zone intrusions. Project page: https://path-etiquette.github.io
Problem

Research questions and friction points this paper is trying to address.

social navigation
robot navigation
social norms
personal space
human-robot interaction
Innovation

Methods, ideas, or system contributions that make the work stand out.

social robot navigation
vision-language model
contextual social reasoning
path selection
foundation model distillation
🔎 Similar Papers
2024-04-25The international journal of robotics researchCitations: 3