RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

📅 2025-04-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Robot learning is hindered by high real-world data acquisition costs and inconsistent evaluation protocols. To address these challenges, we propose MetaSim: a unified simulation abstraction layer supporting multiple physics engines (e.g., PyBullet, Isaac Gym); a hierarchical generalization benchmark spanning imitation and reinforcement learning, the first to jointly model physical fidelity and task diversity in synthetic data; and an integrated pipeline combining procedural scene generation, neural rendering, and physics-driven control to produce high-fidelity synthetic datasets and a world model training framework. Experiments demonstrate an average 12.7% performance improvement on sim-to-real transfer tasks. The proposed benchmark has been adopted as a standard evaluation protocol by three leading robotics laboratories.

Technology Category

Application Category

📝 Abstract
Data scaling and standardized evaluation benchmarks have driven significant advances in natural language processing and computer vision. However, robotics faces unique challenges in scaling data and establishing evaluation protocols. Collecting real-world data is resource-intensive and inefficient, while benchmarking in real-world scenarios remains highly complex. Synthetic data and simulation offer promising alternatives, yet existing efforts often fall short in data quality, diversity, and benchmark standardization. To address these challenges, we introduce RoboVerse, a comprehensive framework comprising a simulation platform, a synthetic dataset, and unified benchmarks. Our simulation platform supports multiple simulators and robotic embodiments, enabling seamless transitions between different environments. The synthetic dataset, featuring high-fidelity physics and photorealistic rendering, is constructed through multiple approaches. Additionally, we propose unified benchmarks for imitation learning and reinforcement learning, enabling evaluation across different levels of generalization. At the core of the simulation platform is MetaSim, an infrastructure that abstracts diverse simulation environments into a universal interface. It restructures existing simulation environments into a simulator-agnostic configuration system, as well as an API aligning different simulator functionalities, such as launching simulation environments, loading assets with initial states, stepping the physics engine, etc. This abstraction ensures interoperability and extensibility. Comprehensive experiments demonstrate that RoboVerse enhances the performance of imitation learning, reinforcement learning, world model learning, and sim-to-real transfer. These results validate the reliability of our dataset and benchmarks, establishing RoboVerse as a robust solution for advancing robot learning.
Problem

Research questions and friction points this paper is trying to address.

Addresses challenges in scaling robot learning data and evaluation.
Proposes a unified platform for synthetic data and simulation.
Introduces standardized benchmarks for generalization in robot learning.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Simulation platform with multiple simulators support
High-fidelity synthetic dataset with diverse approaches
Unified benchmarks for imitation and reinforcement learning
🔎 Similar Papers
Haoran Geng
Haoran Geng
PhD Student, UC Berkeley
RoboticsComputer VisionReinforcement Learning
Feishi Wang
Feishi Wang
Graduate, Peking University
Computer VisionRoboticsComputational Photography
Songlin Wei
Songlin Wei
University of Southern California, (Previously) Peking University
Robotics3D Vision
Yuyang Li
Yuyang Li
Institute for AI, Peking University
Robotic ManipulationTactile SensingHuman-Object Interaction
Bangjun Wang
Bangjun Wang
University of Hong Kong
Artificial IntelligenceRobot LearningComputer Vision
Boshi An
Boshi An
Peking University
RoboticsComputational Neuroscience
C
Charlie Tianyue Cheng
UC Berkeley
Haozhe Lou
Haozhe Lou
University of Southern California
RoboticControl systemPhysics informed neural networkDifferentiable Simulator
Peihao Li
Peihao Li
Tsinghua University; Tongyi Lab, Alibaba
3D reconstructiongenerative modelhuman avatar
Yen-Jen Wang
Yen-Jen Wang
UC Berkeley
Robotics
Y
Yutong Liang
PKU
D
Dylan Goetting
UC Berkeley
Chaoyi Xu
Chaoyi Xu
Unknown affiliation
Haozhe Chen
Haozhe Chen
Princeton University
representation learningfoundation modelsreinforcement learning
Y
Yuxi Qian
Stanford
Yiran Geng
Yiran Geng
PKU; MIT
Reinforcement LearningRoboticsComputer Vision
Jiageng Mao
Jiageng Mao
University of Southern California
RoboticsComputer Vision
Weikang Wan
Weikang Wan
PhD Student, University of California, San Diego
Mingtong Zhang
Mingtong Zhang
University of Southern California
Computer VisionRoboticsRobot Learning
J
Jiangran Lyu
PKU
Siheng Zhao
Siheng Zhao
University of Southern California
RoboticsReinforcement LearningMachine Learning
Jiazhao Zhang
Jiazhao Zhang
Peking University
Embodied AINavigation3D Vision
J
Jialiang Zhang
UC Berkeley, PKU
Chengyang Zhao
Chengyang Zhao
Carnegie Mellon University
RoboticsMachine Learning3D Computer Vision
H
Haoran Lu
PKU
Y
Yufei Ding
UC Berkeley, PKU
R
Ran Gong
UCLA
Y
Yuran Wang
PKU
Yuxuan Kuang
Yuxuan Kuang
Carnegie Mellon University
Robotics3D Computer VisionMachine Learning
Ruihai Wu
Ruihai Wu
Peking University
computer visionrobotics
Baoxiong Jia
Baoxiong Jia
Ph.D. in Computer Science, UCLA
Computer VisionArtificial Intelligence
C
Carlo Sferrazza
UC Berkeley
H
Hao Dong
PKU
S
Siyuan Huang
BIGAI
Y
Yue Wang
USC
J
Jitendra Malik
UC Berkeley
Pieter Abbeel
Pieter Abbeel
UC Berkeley | Covariant
RoboticsMachine LearningAI