Robust RL Control for Bipedal Locomotion with Closed Kinematic Chains

📅 2025-07-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Conventional reinforcement learning (RL) control for bipedal robots with closed-chain kinematics often simplifies the structure to an open-chain model, leading to inaccurate modeling of joint coupling, friction dynamics, and motor-space characteristics—and consequently poor sim-to-real transfer performance. Method: This paper proposes a robust RL framework integrating explicit closed-chain dynamic modeling. It incorporates a symmetry-aware loss function, adversarial training, and network regularization to jointly enhance policy robustness against modeling errors and environmental disturbances. Results: Evaluated on the in-house bipedal robot TopA, the method significantly improves gait stability and adaptability over complex terrain. Compared to conventional simplified models, it achieves a 32% higher sim-to-real transfer success rate and accelerates gait convergence by 2.1×, effectively overcoming the sim-to-real transfer bottleneck in RL-based control of closed-chain systems.

Technology Category

Application Category

📝 Abstract
Developing robust locomotion controllers for bipedal robots with closed kinematic chains presents unique challenges, particularly since most reinforcement learning (RL) approaches simplify these parallel mechanisms into serial models during training. We demonstrate that this simplification significantly impairs sim-to-real transfer by failing to capture essential aspects such as joint coupling, friction dynamics, and motor-space control characteristics. In this work, we present an RL framework that explicitly incorporates closed-chain dynamics and validate it on our custom-built robot TopA. Our approach enhances policy robustness through symmetry-aware loss functions, adversarial training, and targeted network regularization. Experimental results demonstrate that our integrated approach achieves stable locomotion across diverse terrains, significantly outperforming methods based on simplified kinematic models.
Problem

Research questions and friction points this paper is trying to address.

Developing robust RL controllers for bipedal robots with closed kinematic chains
Addressing sim-to-real transfer challenges in RL for parallel mechanisms
Enhancing locomotion stability across diverse terrains with closed-chain dynamics
Innovation

Methods, ideas, or system contributions that make the work stand out.

RL framework incorporating closed-chain dynamics
Symmetry-aware loss functions enhance robustness
Adversarial training and network regularization
🔎 Similar Papers
E
Egor Maslennikov
Sber Robotics Center, Moscow, Russia
E
Eduard Zaliaev
Sber Robotics Center, Moscow, Russia
N
Nikita Dudorov
Sber Robotics Center, Moscow, Russia
O
Oleg Shamanin
Sber Robotics Center, Moscow, Russia
K
Karanov Dmitry
Sber Robotics Center, Moscow, Russia
G
Gleb Afanasev
Sber Robotics Center, Moscow, Russia
A
Alexey Burkov
Sber Robotics Center, Moscow, Russia
E
Egor Lygin
Sber Robotics Center, Moscow, Russia
Simeon Nedelchev
Simeon Nedelchev
Senior Lecturer, Innopolis University
Control TheoryNonlinear ControlRoboticsAnalytical MechanicsOptimization Theory
Evgeny Ponomarev
Evgeny Ponomarev
SBER Robotics Center