RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion

📅 2026-03-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of locomotion control for legged and wheeled-legged hybrid robots in unstructured environments, where complex contact sequencing complicates motion planning. The authors propose an explicit contact-aware hierarchical architecture that decouples contact timing planning from execution: a high-level reinforcement learning agent generates gait and navigation commands, while a low-level model predictive controller carries out the motion. Notably, this approach achieves zero-shot sim-to-real transfer without domain randomization and demonstrates emergent capabilities—including aperiodic gait generation and adaptive contact timing—across multiple robotic platforms weighing 50–120 kg. Experimental validation on the Centauro wheeled-legged humanoid confirms the method’s effectiveness, and the associated software framework along with evaluation results has been publicly released.

Technology Category

Application Category

📝 Abstract
We propose a contact-explicit hierarchical architecture coupling Reinforcement Learning (RL) and Model Predictive Control (MPC), where a high-level RL agent provides gait and navigation commands to a low-level locomotion MPC. This offloads the combinatorial burden of contact timing from the MPC by learning acyclic gaits through trial and error in simulation. We show that only a minimal set of rewards and limited tuning are required to obtain effective policies. We validate the architecture in simulation across robotic platforms spanning 50 kg to 120 kg and different MPC implementations, observing the emergence of acyclic gaits and timing adaptations in flat-terrain legged and hybrid locomotion, and further demonstrating extensibility to non-flat terrains. Across all platforms, we achieve zero-shot sim-to-sim transfer without domain randomization, and we further demonstrate zero-shot sim-to-real transfer without domain randomization on Centauro, our 120 kg wheeled-legged humanoid robot. We make our software framework and evaluation results publicly available at https://github.com/AndrePatri/AugMPC.
Problem

Research questions and friction points this paper is trying to address.

non-gaited locomotion
contact timing
legged robots
hybrid locomotion
combinatorial complexity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement Learning
Model Predictive Control
Non-gaited Locomotion
Zero-shot Sim-to-Real Transfer
Contact-explicit Hierarchical Control
🔎 Similar Papers
No similar papers found.
A
Andrea Patrizi
Humanoids and Human-Centred Mechatronics (HHCM) lab, Istituto Italiano di Tecnologia (IIT), Via San Quirico 19d, 16163 Genova; and Department of Informatics, Bioengineering, Robotics and Systems Engineering, University of Genova, Via All’Opera Pia 13, 16145 Genova
C
Carlo Rizzardo
Humanoids and Human-Centred Mechatronics (HHCM) lab, Istituto Italiano di Tecnologia (IIT), Via San Quirico 19d, 16163 Genova
Arturo Laurenzi
Arturo Laurenzi
Istituto Italiano di Tecnologia (IIT)
roboticslocomotionmodel predictive controlstate estimationsoftware architecture
Francesco Ruscelli
Francesco Ruscelli
Istituto Italiano di Tecnologia
Robotics
Luca Rossini
Luca Rossini
Associate Professor in Statistics - University of Milan
Bayesian nonparametricsEconometricsEnergyForecastingCopula Models
N
Nikos G. Tsagarakis
Humanoids and Human-Centred Mechatronics (HHCM) lab, Istituto Italiano di Tecnologia (IIT), Via San Quirico 19d, 16163 Genova