Open X-Embodiment: Robotic Learning Datasets and RT-X Models

📅 2023-10-13

🏛️ arXiv.org

📈 Citations: 451

✨ Influential: 48

career value

209K/year

🤖 AI Summary

Robot learning suffers from fragmentation, requiring separate model training for each robot platform, task, and environment. Method: This paper introduces X-robot, a universal policy paradigm enabling the first cross-platform positive transfer. We construct the largest standardized robotic manipulation dataset to date, developed collaboratively across multiple institutions. We propose RT-X, a Transformer-based architecture that unifies action representations and cross-robot data formats, trained via large-scale multi-robot behavioral cloning and transfer learning. Contribution/Results: RT-X demonstrates significant skill generalization across 22 heterogeneous robot platforms. In zero-shot and few-shot settings on unseen robots, it achieves substantial improvements in task success rates. This work provides the first systematic empirical validation of both the effectiveness and scalability of cross-robot positive transfer—establishing a foundational step toward generalizable robotic policies.

📝 Abstract

Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train generalist X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. More details can be found on the project website https://robotics-transformer-x.github.io.

Problem

Research questions and friction points this paper is trying to address.

Can generalist X-robot policies replace application-specific robotic models?

How to train adaptable models for diverse robots, tasks, and environments?

Does large-scale multi-robot data enable effective transfer learning?

Innovation

Methods, ideas, or system contributions that make the work stand out.

Standardized datasets for diverse robotic learning

Generalist X-robot policy for multiple applications

High-capacity RT-X model enabling positive transfer

🔎 Similar Papers

No similar papers found.

Toyota Research Institute

Los Altos, CA / Cambridge, MA

Amazon Industrial Robotics - Applied Scientist II Intern / Co-op - 2026, Amazon Industrial Robotics

Amazon

The base pay for this position ranges from $65.38/hr in our lowest geographic market up to $107.40/hr in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.

Seattle, Washington, USA / North Reading, Massachusetts, USA / Westboro, Wisconsin, USA

Research Scientist Intern, Robotic Control Policy (PhD)