KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills

📅 2025-06-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing imitation learning methods struggle to replicate highly dynamic human skills—such as martial arts or dance—due to their reliance on smooth, low-velocity motion assumptions, rendering them inadequate for handling strong impacts and rapid directional changes. To address this, we propose a physics-constrained whole-body control framework. First, we design a physics-prioritized motion retargeting pipeline that safely maps human motions onto robot configurations. Second, we introduce a two-tier optimization mechanism: an upper layer dynamically adjusts tracking tolerance based on real-time error (adaptive curriculum), while a novel asymmetric Actor-Critic architecture in the lower layer enables high-dynamics policy training, integrated with adaptive reward shaping and real-time whole-body dynamics control. Evaluated on the Unitree G1 humanoid platform, our method significantly reduces tracking error and robustly reproduces complex, high-dynamic motions—outperforming state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract
Humanoid robots are promising to acquire various skills by imitating human behaviors. However, existing algorithms are only capable of tracking smooth, low-speed human motions, even with delicate reward and curriculum design. This paper presents a physics-based humanoid control framework, aiming to master highly-dynamic human behaviors such as Kungfu and dancing through multi-steps motion processing and adaptive motion tracking. For motion processing, we design a pipeline to extract, filter out, correct, and retarget motions, while ensuring compliance with physical constraints to the maximum extent. For motion imitation, we formulate a bi-level optimization problem to dynamically adjust the tracking accuracy tolerance based on the current tracking error, creating an adaptive curriculum mechanism. We further construct an asymmetric actor-critic framework for policy training. In experiments, we train whole-body control policies to imitate a set of highly-dynamic motions. Our method achieves significantly lower tracking errors than existing approaches and is successfully deployed on the Unitree G1 robot, demonstrating stable and expressive behaviors. The project page is https://kungfu-bot.github.io.
Problem

Research questions and friction points this paper is trying to address.

Enable humanoid robots to perform highly-dynamic human motions
Develop physics-based control for complex skills like Kungfu
Improve motion tracking accuracy for expressive robot behaviors
Innovation

Methods, ideas, or system contributions that make the work stand out.

Physics-based humanoid control for dynamic skills
Multi-step motion processing with physical constraints
Bi-level optimization for adaptive motion tracking
🔎 Similar Papers
No similar papers found.
W
Weiji Xie
Institute of Artificial Intelligence (TeleAI), China Telecom; Shanghai Jiao Tong University
J
Jinrui Han
Institute of Artificial Intelligence (TeleAI), China Telecom; Shanghai Jiao Tong University
Jiakun Zheng
Jiakun Zheng
The Hong Kong University of Science and Technology
Computing-in-MemoryAI Accelerator
H
Huanyu Li
Institute of Artificial Intelligence (TeleAI), China Telecom; Harbin Institute of Technology
Xinzhe Liu
Xinzhe Liu
Shanghaitech University
Robotics
Jiyuan Shi
Jiyuan Shi
Tsinghua University
Reinforcement LearningRobotics
W
Weinan Zhang
Shanghai Jiao Tong University
Chenjia Bai
Chenjia Bai
Institute of Artificial Intelligence, China Telecom(中国电信人工智能研究院, TeleAI)
Reinforcement LearningRoboticsEmbodied AI
X
Xuelong Li
Institute of Artificial Intelligence (TeleAI), China Telecom