CLONE: Closed-Loop Whole-Body Humanoid Teleoperation for Long-Horizon Tasks

📅 2025-06-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current full-body teleoperation systems suffer from two critical bottlenecks: (1) decoupled upper- and lower-limb control, severely degrading interlimb coordination; and (2) open-loop operation, causing cumulative global pose drift over time. To address these, we propose a novel closed-loop, full-body haptic teleoperation paradigm leveraging head-and-hand mixed-reality (MR) tracking and a Mixture-of-Experts (MoE)-driven control architecture. Our method employs 6-DOF head-mounted display and hand-tracking sensors for master input, integrated with real-time whole-body kinematic optimization, online error compensation, and closed-loop feedback control—enabling drift-free global localization and skill-coordinated motion using head-and-hand inputs only. Experiments on complex tasks such as “ground object retrieval” demonstrate a 92% reduction in positional drift and a threefold increase in single-session duration. This work establishes the first demonstration of high-fidelity, long-duration, naturally coordinated teleoperation for humanoid robots.

Technology Category

Application Category

📝 Abstract
Humanoid teleoperation plays a vital role in demonstrating and collecting data for complex humanoid-scene interactions. However, current teleoperation systems face critical limitations: they decouple upper- and lower-body control to maintain stability, restricting natural coordination, and operate open-loop without real-time position feedback, leading to accumulated drift. The fundamental challenge is achieving precise, coordinated whole-body teleoperation over extended durations while maintaining accurate global positioning. Here we show that an MoE-based teleoperation system, CLONE, with closed-loop error correction enables unprecedented whole-body teleoperation fidelity, maintaining minimal positional drift over long-range trajectories using only head and hand tracking from an MR headset. Unlike previous methods that either sacrifice coordination for stability or suffer from unbounded drift, CLONE learns diverse motion skills while preventing tracking error accumulation through real-time feedback, enabling complex coordinated movements such as ``picking up objects from the ground.'' These results establish a new milestone for whole-body humanoid teleoperation for long-horizon humanoid-scene interaction tasks.
Problem

Research questions and friction points this paper is trying to address.

Achieve precise whole-body humanoid teleoperation with coordination
Minimize positional drift in long-duration teleoperation tasks
Enable complex movements like picking objects with real-time feedback
Innovation

Methods, ideas, or system contributions that make the work stand out.

MoE-based system for whole-body teleoperation
Closed-loop error correction minimizes drift
Real-time feedback enables complex coordinated movements
🔎 Similar Papers
No similar papers found.
Y
Yixuan Li
School of Computer Science and Technology, Beijing Institute of Technology; State Key Laboratory of General Artificial Intelligence, BIGAI
Y
Yutang Lin
School of Psychological and Cognitive Sciences, Peking University; Institute for Artificial Intelligence, Peking University; Yuanpei College, Peking University; Beijing Key Laboratory of Behavior and Mental Health, Peking University
Jieming Cui
Jieming Cui
Peking University
Tengyu Liu
Tengyu Liu
Beijing Institute for General Artificial Intelligence
computer visionhuman object interactionhuman motion generationgrasping
W
Wei Liang
School of Computer Science and Technology, Beijing Institute of Technology
Yixin Zhu
Yixin Zhu
Assistant Professor, Peking University
Computer VisionVisual ReasoningHuman-Robot Teaming
S
Siyuan Huang
State Key Laboratory of General Artificial Intelligence, BIGAI; Joint Laboratory of Embodied AI and Humanoid Robots, BIGAI & UniTree Robotics