🤖 AI Summary
Traditional navigation methods face limitations in complex real-world environments: map-free reactive approaches lack robustness; mapping-based methods incur high computational overhead; and learning-based methods suffer from poor generalization and heavy reliance on large-scale training data.
Method: This paper proposes a lightweight, map-free real-time visual navigation framework. It introduces Pareto-optimal decision-making into visual navigation for the first time, integrating CNN-based semantic feature extraction, multi-objective optimization (balancing safety and efficiency), and closed-loop visual servo control grounded in the image Jacobian matrix.
Contribution/Results: Its core innovation lies in a semantics-driven, unified design for image-space decision-making and control. Experiments demonstrate over 30% improvement in navigation success rate across diverse cluttered environments, while achieving strong generalization, real-time performance (>30 FPS), and low computational cost.
📝 Abstract
Navigating complex real-world environments requires semantic understanding and adaptive decision-making. Traditional reactive methods without maps often fail in cluttered settings, map-based approaches demand heavy mapping effort, and learning-based solutions rely on large datasets with limited generalization. To address these challenges, we present Pareto-Optimal Visual Navigation, a lightweight image-space framework that combines data-driven semantics, Pareto-optimal decision-making, and visual servoing for real-time navigation.