🤖 AI Summary
Generative navigation suffers from two fundamental limitations: (1) trajectory predictions lack metric scale, causing spatial misalignment in physical environments; and (2) control policies target only individual waypoints without considering the global path, leading to myopic behavior and collision risks. To address these issues, we propose MetricNet—the first end-to-end framework for metric-scale recovery in generative navigation—which estimates true Euclidean distances between waypoints via deep regression, thereby aligning pixel/feature-space representations with physical coordinates. Building upon this, we introduce MetricNav, a navigation policy that preserves the expressive power of generative modeling while integrating long-horizon path constraints and local obstacle avoidance. Extensive evaluations on both simulation and real-world robotic platforms demonstrate significant improvements in navigation success rate, exploration efficiency, and collision safety. Our approach establishes a measurable, deployable paradigm for generative navigation.
📝 Abstract
Generative navigation policies have made rapid progress in improving end-to-end learned navigation. Despite their promising results, this paradigm has two structural problems. First, the sampled trajectories exist in an abstract, unscaled space without metric grounding. Second, the control strategy discards the full path, instead moving directly towards a single waypoint. This leads to short-sighted and unsafe actions, moving the robot towards obstacles that a complete and correctly scaled path would circumvent. To address these issues, we propose MetricNet, an effective add-on for generative navigation that predicts the metric distance between waypoints, grounding policy outputs in real-world coordinates. We evaluate our method in simulation with a new benchmarking framework and show that executing MetricNet-scaled waypoints significantly improves both navigation and exploration performance. Beyond simulation, we further validate our approach in real-world experiments. Finally, we propose MetricNav, which integrates MetricNet into a navigation policy to guide the robot away from obstacles while still moving towards the goal.