Towards Data-Driven Metrics for Social Robot Navigation Benchmarking

📅 2025-09-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current social robot navigation evaluation relies heavily on hand-crafted rules and lacks quantifiable, standardized benchmarks. To address this, we propose the first data-driven metric for assessing social navigation quality. Our method constructs a high-quality dataset comprising 4,402 real-world and simulated navigation trajectories, each annotated with multi-round human perceptual ratings (e.g., comfort, naturalness). We then train a supervised RNN-based evaluator using these human scores as ground-truth labels. Our key contributions are: (1) releasing the first empirically grounded, fine-grained human-annotated dataset for social navigation evaluation; (2) establishing a generalizable and interpretable data-driven evaluation paradigm that facilitates navigation policy optimization and fair cross-method comparison; and (3) open-sourcing all data, code, and trained model weights to support community benchmarking and reproducibility.

Technology Category

Application Category

📝 Abstract
This paper presents a joint effort towards the development of a data-driven Social Robot Navigation metric to facilitate benchmarking and policy optimization. We provide our motivations for our approach and describe our proposal for storing rated social navigation trajectory datasets. Following these guidelines, we compiled a dataset with 4427 trajectories -- 182 real and 4245 simulated -- and presented it to human raters, yielding a total of 4402 rated trajectories after data quality assurance. We also trained an RNN-based baseline metric on the dataset and present quantitative and qualitative results. All data, software, and model weights are publicly available.
Problem

Research questions and friction points this paper is trying to address.

Developing data-driven metrics for social robot navigation benchmarking
Creating rated social navigation trajectory datasets for evaluation
Training RNN-based baseline metrics using human-rated trajectory data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Data-driven social navigation metric development
RNN-based baseline metric training
Public dataset with rated trajectories
🔎 Similar Papers
No similar papers found.
P
Pilar Bachiller-Burgos
Universidad de Extremadura, Avenida de la Universidad, s/n, Cáceres 10003, Extremadura, Spain
Ulysses Bernardet
Ulysses Bernardet
Aston University, Birmingham, UK
cybernetic humanvirtual humansynthetic psychologyneuroroboticsHCI
L
Luis V. Calderita
Universidad de Extremadura, Avenida de la Universidad, s/n, Cáceres 10003, Extremadura, Spain
P
Pranup Chhetri
Aston University, Birmingham, United Kingdom
A
Anthony Francis
Logical Robotics, United States of America
Noriaki Hirose
Noriaki Hirose
University of California, Berkeley, Stanford University, Toyota Central R&D Labs., INC.
Machine LearningRoboticsMotion Control
N
Noé Pérez
Universidad Pablo de Olavide, Spain
Dhruv Shah
Dhruv Shah
Princeton University, Google DeepMind
Robot LearningArtificial IntelligenceRoboticsReinforcement Learning
P
Phani T. Singamaneni
LAAS-CNRS, France
X
Xuesu Xiao
George Mason University, United States of America
Luis J. Manso
Luis J. Manso
Senior Lecturer (Associate Professor) in Computer Science, Aston University, UK
autonomous roboticsactive perceptionsocial navigationhuman-robot interaction