🤖 AI Summary
To address the scarcity of high-quality, large-scale manipulation data for bimanual robotic systems—stemming from hardware heterogeneity and hindering dexterous manipulation research—this paper introduces the largest open-source bimanual robotic manipulation dataset to date. It encompasses 15 heterogeneous robot platforms, 16 operational scenarios, and 421 distinct tasks, with over 180,000 expert demonstrations. We propose a hierarchical capability pyramid annotation framework enabling multi-granularity semantic labeling at trajectory-, segment-, and frame-levels. Additionally, we design the Robot Trajectory Markup Language (RTML) and the unified management framework CoRobot, which support standardized parsing, automated annotation generation, and quality assessment of heterogeneous data. Experimental results demonstrate that our dataset significantly improves performance across diverse models on cross-platform bimanual manipulation tasks, exhibiting strong generalization and reliability.
📝 Abstract
Bimanual manipulation is essential for achieving human-like dexterity in robots, but the large-scale and diverse bimanual robot datasets remain scarce due to hardware heterogeneity across robotic platforms. To address the challenge, we present RoboCOIN, a comprehensive multi-embodiment bimanual manipulation dataset with over 180,000 demonstrations collected from 15 distinct robotic platforms. The dataset covers 16 scenarios, including residential, commercial, and working environments, with 421 tasks systematically organized by bimanual coordination patterns and object properties. Our key innovation is a hierarchical capability pyramid that provides multi-level annotations, spanning trajectory-level concepts, segment-level subtasks, and frame-level kinematics. We further develop CoRobot, a comprehensive processing framework featuring Robot Trajectory Markup Language (RTML) for quality assessment, automated annotation generation, and unified multi-embodiment management. Extensive experiments demonstrate the reliability and effectiveness of RoboCOIN in multi-embodiment bimanual learning, with significant performance improvements across various model architectures and robotic platforms. The complete dataset and framework are open-sourced and publicly available for further research purposes. Project website: https://FlagOpen.github.io/RoboCOIN/.