AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

📅 2025-03-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address key challenges in real-world general-purpose robotic manipulation—namely limited data scale, poor generalization, and coarse-grained skill representations—this paper introduces AgiBot World, a large-scale robotic manipulation platform encompassing over one million trajectories, 217 diverse tasks, and five deployment scenarios. It supports fine-grained skill learning across grippers to dexterous hands and integrates tactile-visual multimodal sensing. The core contribution is Genie Operator-1 (GO-1), a universal policy leveraging implicit action representation for data-efficient learning. Complementary innovations include a standardized human-in-the-loop data collection pipeline, latent-space action modeling, and scalable hardware interfaces. We publicly release high-quality datasets and models to advance embodied intelligence democratization. Experiments demonstrate that the pre-trained GO-1 policy achieves a 30% average improvement over Open X-Embodiment baselines; on complex long-horizon tasks, it attains >60% success rate—surpassing RDT by 32%.

Technology Category

Application Category

📝 Abstract
We explore how scalable robot data can address real-world challenges for generalized robotic manipulation. Introducing AgiBot World, a large-scale platform comprising over 1 million trajectories across 217 tasks in five deployment scenarios, we achieve an order-of-magnitude increase in data scale compared to existing datasets. Accelerated by a standardized collection pipeline with human-in-the-loop verification, AgiBot World guarantees high-quality and diverse data distribution. It is extensible from grippers to dexterous hands and visuo-tactile sensors for fine-grained skill acquisition. Building on top of data, we introduce Genie Operator-1 (GO-1), a novel generalist policy that leverages latent action representations to maximize data utilization, demonstrating predictable performance scaling with increased data volume. Policies pre-trained on our dataset achieve an average performance improvement of 30% over those trained on Open X-Embodiment, both in in-domain and out-of-distribution scenarios. GO-1 exhibits exceptional capability in real-world dexterous and long-horizon tasks, achieving over 60% success rate on complex tasks and outperforming prior RDT approach by 32%. By open-sourcing the dataset, tools, and models, we aim to democratize access to large-scale, high-quality robot data, advancing the pursuit of scalable and general-purpose intelligence.
Problem

Research questions and friction points this paper is trying to address.

Addressing real-world robotic manipulation challenges using scalable robot data.
Developing a large-scale platform for diverse and high-quality robot data collection.
Creating a generalist policy for improved performance in dexterous and long-horizon tasks.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large-scale platform with 1M+ trajectories
Standardized pipeline with human verification
Generalist policy maximizing data utilization
🔎 Similar Papers
No similar papers found.
A
AgiBot-World-Contributors
Qingwen Bu
Qingwen Bu
HKU | OpenDriveLab
Robot LearningComputer VisionMachine Learning
J
Jisong Cai
L
Li Chen
X
Xiuqi Cui
Y
Yan Ding
S
Siyuan Feng
Shenyuan Gao
Shenyuan Gao
Hong Kong University of Science and Technology
World ModelsRoboticsEmbodied AIGenerative AI
X
Xindong He
X
Xu Huang
S
Shu Jiang
Y
Yuxin Jiang
C
Cheng Jing
H
Hongyang Li
J
Jialu Li
C
Chiming Liu
Y
Yi Liu
Y
Yuxiang Lu
Jianlan Luo
Jianlan Luo
UC Berkeley, Google X
RoboticsMachine LearningArtificial Intelligence
Ping Luo
Ping Luo
National University of Defense Technology
distributed_computing
Y
Yao Mu
Y
Yuehan Niu
Y
Yixuan Pan
J
Jiangmiao Pang
Y
Yu Qiao
G
Guanghui Ren
C
Cheng Ruan
J
Jiaqi Shan
Y
Yongjian Shen
C
Chengshi Shi
M
Mingkang Shi
Modi Shi
Modi Shi
Beihang University
embodied ai
Chonghao Sima
Chonghao Sima
HKU | OpenDriveLab
computer visionautonomous driving
J
Jianheng Song
Huijie Wang
Huijie Wang
OpenDriveLab
Embodied AIScene Understanding
W
Wenhao Wang
Dafeng Wei
Dafeng Wei
Shanghai Jiao Tong University
AILLMCVVLM
C
Chengen Xie
G
Guo Xu
Junchi Yan
Junchi Yan
FIAPR & ICML Board Member, SJTU (2018-), SII (2024-), AWS (2019-2022), IBM (2011-2018)
Computational IntelligenceAI4ScienceMachine LearningAutonomous Driving
C
Cunbiao Yang
L
Lei Yang
S
Shukai Yang
Maoqing Yao
Maoqing Yao
Google
J
Jia Zeng
C
Chi Zhang
Q
Qinglin Zhang
B
Bin Zhao
C
Chengyue Zhao
Jiaqi Zhao
Jiaqi Zhao
Xidian University
privacy-preserving machine learning
J
Jianchao Zhu