Unleashing Humanoid Reaching Potential via Real-world-Ready Skill Space

📅 2025-05-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses core challenges in achieving robust, generalizable goal-reaching for humanoid robots in large 3D environments: tight coupling among base localization/reorientation, whole-body pose regulation, and end-effector pose control; slow zero-shot training convergence; and poor sim-to-real transfer. We propose the Real-World-Ready Skill Space (R2S2), which embeds empirically validated atomic skills into a structured implicit skill space to form a unified, prior-enriched representation. R2S2 integrates a sampling-based high-level planner with a joint reinforcement learning–motion control optimization framework to enable end-to-end goal-reaching. Experiments demonstrate zero-shot sim-to-real transfer across complex scenarios—including multi-height terrains, long-distance navigation, and dynamic obstacles—while significantly improving task success rate, generalization, and robustness.

Technology Category

Application Category

📝 Abstract
Humans possess a large reachable space in the 3D world, enabling interaction with objects at varying heights and distances. However, realizing such large-space reaching on humanoids is a complex whole-body control problem and requires the robot to master diverse skills simultaneously-including base positioning and reorientation, height and body posture adjustments, and end-effector pose control. Learning from scratch often leads to optimization difficulty and poor sim2real transferability. To address this challenge, we propose Real-world-Ready Skill Space (R2S2). Our approach begins with a carefully designed skill library consisting of real-world-ready primitive skills. We ensure optimal performance and robust sim2real transfer through individual skill tuning and sim2real evaluation. These skills are then ensembled into a unified latent space, serving as a structured prior that helps task execution in an efficient and sim2real transferable manner. A high-level planner, trained to sample skills from this space, enables the robot to accomplish real-world goal-reaching tasks. We demonstrate zero-shot sim2real transfer and validate R2S2 in multiple challenging goal-reaching scenarios.
Problem

Research questions and friction points this paper is trying to address.

Achieving large-space reaching on humanoids via whole-body control
Overcoming optimization and sim2real transfer challenges in skill learning
Enabling real-world goal-reaching with a unified skill space
Innovation

Methods, ideas, or system contributions that make the work stand out.

Real-world-ready primitive skill library
Unified latent space for skill ensembling
High-level planner for skill sampling
🔎 Similar Papers
No similar papers found.
Zhikai Zhang
Zhikai Zhang
Tsinghua University
C
Chao Chen
Galbot
H
Han Xue
Tsinghua University, Galbot
J
Jilong Wang
Peking University, Galbot
S
Sikai Liang
Galbot, Tongji University
Zongzhang Zhang
Zongzhang Zhang
Nanjing University
Artificial IntelligenceReinforcement LearningProbabilistic PlanningMulti-Agent Systems
H
He Wang
Peking University, Galbot
L
Li Yi
Tsinghua University, Shanghai AI Laboratory, Shanghai Qi Zhi Institute