FastStair: Learning to Run Up Stairs with Humanoid Robots

📅 2026-01-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of achieving both agility and stability in humanoid robots during high-speed stair climbing, where existing reinforcement learning approaches often yield unsafe behaviors while model-based planners tend to be overly conservative. The authors propose FastStair, a framework that integrates a dynamic foothold planner into the reinforcement learning pipeline. It employs multi-stage pretraining to establish a safe base policy and leverages LoRA to fuse speed-specialized expert policies, enabling adaptive control across the full velocity range. Evaluated on the Oli humanoid robot, the method achieves stable stair ascent at 1.65 m/s, successfully ascending a 33-step spiral staircase (17 cm per step) in under 12 seconds, and secured first place in the Canton Tower Robot Climbing Challenge, thereby breaking the speed barrier for humanoid stair running.

Technology Category

Application Category

📝 Abstract
Running up stairs is effortless for humans but remains extremely challenging for humanoid robots due to the simultaneous requirements of high agility and strict stability. Model-free reinforcement learning (RL) can generate dynamic locomotion, yet implicit stability rewards and heavy reliance on task-specific reward shaping tend to result in unsafe behaviors, especially on stairs; conversely, model-based foothold planners encode contact feasibility and stability structure, but enforcing their hard constraints often induces conservative motion that limits speed. We present FastStair, a planner-guided, multi-stage learning framework that reconciles these complementary strengths to achieve fast and stable stair ascent. FastStair integrates a parallel model-based foothold planner into the RL training loop to bias exploration toward dynamically feasible contacts and to pretrain a safety-focused base policy. To mitigate planner-induced conservatism and the discrepancy between low- and high-speed action distributions, the base policy was fine-tuned into speed-specialized experts and then integrated via Low-Rank Adaptation (LoRA) to enable smooth operation across the full commanded-speed range. We deploy the resulting controller on the Oli humanoid robot, achieving stable stair ascent at commanded speeds up to 1.65 m/s and traversing a 33-step spiral staircase (17 cm rise per step) in 12 s, demonstrating robust high-speed performance on long staircases. Notably, the proposed approach served as the champion solution in the Canton Tower Robot Run Up Competition.
Problem

Research questions and friction points this paper is trying to address.

humanoid robots
stair running
dynamic locomotion
stability
agility
Innovation

Methods, ideas, or system contributions that make the work stand out.

planner-guided reinforcement learning
multi-stage policy learning
Low-Rank Adaptation (LoRA)
dynamic stair climbing
humanoid locomotion
🔎 Similar Papers
No similar papers found.
Yan Liu
Yan Liu
Indiana University, California Institute of Technology, Washington University
Wavefront shapingAdaptive opticsOptical coherence tomographyOcular imagingPhotoacoustics
Tao Yu
Tao Yu
LimX Dynamics << Tencent
Reinforcement LearningComputer VisionVison-Language Models
H
Haolin Song
Department of Electronic Engineering and Information Science (EEIS), University of Science and Technology of China, Hefei 230027, China
Hongbo Zhu
Hongbo Zhu
The University of Manchester
XAIVLMDiffusion
N
Nianzong Hu
LimX Dynamics, Shenzhen, China
Y
Yuzhi Hao
LimX Dynamics, Shenzhen, China; Department of Mechanical Engineering, National University of Singapore, Singapore 117575
X
Xiuyong Yao
LimX Dynamics, Shenzhen, China
X
Xizhe Zang
School of Mechanics Engineering, Harbin Institute of Technology (HIT), Harbin Heilongjiang 150001, China
Hua Chen
Hua Chen
Assistant Professor, ZJU-UIUC Institute; Co-founder, LimX Dynamics
RoboticsEmbodied AIRobot LearningReinforcement LearningControl
J
Jie Zhao
School of Mechanics Engineering, Harbin Institute of Technology (HIT), Harbin Heilongjiang 150001, China