NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation

📅 2025-05-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing RL navigation benchmarks suffer from closed ecosystems and poor cross-domain generalization, hindering fair evaluation of multi-modal robots and sim-to-real transfer. To address this, we propose NavBench—the first cross-medium, multi-platform unified navigation benchmark, covering underwater, ground, aerial, and space domains. NavBench introduces a novel cross-medium task formulation, a modular evaluation framework, and a standardized RL interface built on IsaacLab, enabling comparable assessment across heterogeneous robotic platforms—including wheeled, jet-propelled, and underwater vehicles. We further enhance policy generalization via domain randomization and plug-and-play robot templates. Experiments demonstrate successful sim-to-real transfer to a satellite manipulator simulator, an unmanned surface vehicle, and a wheeled robot. The codebase is open-sourced to facilitate community-driven extension and benchmarking.

Technology Category

Application Category

📝 Abstract
Autonomous robots must navigate and operate in diverse environments, from terrestrial and aquatic settings to aerial and space domains. While Reinforcement Learning (RL) has shown promise in training policies for specific autonomous robots, existing benchmarks are often constrained to unique platforms, limiting generalization and fair comparisons across different mobility systems. In this paper, we present NavBench, a multi-domain benchmark for training and evaluating RL-based navigation policies across diverse robotic platforms and operational environments. Built on IsaacLab, our framework standardizes task definitions, enabling different robots to tackle various navigation challenges without the need for ad-hoc task redesigns or custom evaluation metrics. Our benchmark addresses three key challenges: (1) Unified cross-medium benchmarking, enabling direct evaluation of diverse actuation methods (thrusters, wheels, water-based propulsion) in realistic environments; (2) Scalable and modular design, facilitating seamless robot-task interchangeability and reproducible training pipelines; and (3) Robust sim-to-real validation, demonstrated through successful policy transfer to multiple real-world robots, including a satellite robotic simulator, an unmanned surface vessel, and a wheeled ground vehicle. By ensuring consistency between simulation and real-world deployment, NavBench simplifies the development of adaptable RL-based navigation strategies. Its modular design allows researchers to easily integrate custom robots and tasks by following the framework's predefined templates, making it accessible for a wide range of applications. Our code is publicly available at NavBench.
Problem

Research questions and friction points this paper is trying to address.

Unified cross-medium benchmarking for diverse robotic platforms
Scalable modular design enabling robot-task interchangeability
Robust sim-to-real validation for real-world deployment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified cross-medium benchmarking for diverse actuation methods
Scalable modular design enabling robot-task interchangeability
Robust sim-to-real validation for real-world policy transfer
🔎 Similar Papers
No similar papers found.
M
Matteo El-Hariry
Space Robotics Research Group, SnT, University of Luxembourg
Antoine Richard
Antoine Richard
Nvidia
RoboticsComputer VisionControlMachine LearningReinforcement Learning
R
Ricard M. Castan
Space Robotics Research Group, SnT, University of Luxembourg
L
Luis F. W. Batista
GeorgiaTech Europe - IRL2958 GT-CNRS, Metz, France
Matthieu Geist
Matthieu Geist
Earth Species Project (ex-google, ex-cohere, on leave of Professor, Université de Lorraine)
reinforcement learningmachine learning
C
Cédric Pradalier
GeorgiaTech Europe - IRL2958 GT-CNRS, Metz, France
M
M. Olivares-Mendez
Space Robotics Research Group, SnT, University of Luxembourg