🤖 AI Summary
ALOHA-based bimanual teleoperation robots exhibit insufficient performance, poor robustness, and suboptimal human factors in kitchen environments. Method: This work proposes a systematic enhancement built upon the ALOHA2 architecture, integrating dual ViperX 6-DoF master arms with a WidowX auxiliary arm/gripper; incorporating backdrivable actuation, gravity compensation mechanisms, and multi-view RGB visual feedback; and optimizing teleoperation mapping and data acquisition for improved motion fidelity and real-time responsiveness. Contribution/Results: Experiments demonstrate significantly higher success rates and stability in complex kitchen tasks—including object retrieval/placement, door/ drawer manipulation, and pouring—while enabling high-fidelity motion capture and synchronized multimodal data logging. The system establishes a scalable, high-quality dataset foundation for downstream imitation learning and embodied policy training.
📝 Abstract
ALOHA2 is an enhanced version of the dual-arm teleoperated robot ALOHA, featuring higher performance and robustness compared to the original design, while also being more ergonomic. Like ALOHA, ALOHA2 consists of two grippers and two ViperX 6-DoF arms, as well as two smaller WidowX arms. Users control the follower mechanical arms by operating the leader mechanical arms through back-driving. The device also includes cameras that generate images from multiple viewpoints, allowing for RGB data collection during teleoperation. The robot is mounted on a 48-inch x 30-inch table, equipped with an aluminum frame that provides additional mounting points for cameras and gravity compensation systems.