McARL:Morphology-Control-Aware Reinforcement Learning for Generalizable Quadrupedal Locomotion

📅 2025-05-23

📈 Citations: 0

✨ Influential: 0

career value

206K/year

🤖 AI Summary

Existing reinforcement learning approaches for quadrupedal robots suffer from poor generalization across morphologically distinct platforms, necessitating hyperparameter re-tuning and suffering performance degradation during transfer. To address this, we propose Morphology-aware Conditional Actor–Critic Reinforcement Learning (McARL), a morphology-aware RL framework that embeds stochastically sampled morphology vectors into both policy and value networks, enabling morphology-conditioned control policies. McARL achieves zero-shot cross-platform transfer for the first time and quantifies, for the first time, the correlation between morphological distance and transfer performance—enhancing robustness. Built upon PPO, McARL is validated across multiple simulation environments and real-world platforms (Go1, Go2, A1, Mini Cheetah). A single learned policy achieves 6.0 m/s on Go1 and zero-shot transfers to Go2 at 3.5 m/s. Cross-morphology performance improves by 44–150% over baseline PPO variants.

Technology Category

Application Category

📝 Abstract

We present Morphology-Control-Aware Reinforcement Learning (McARL), a new approach to overcome challenges of hyperparameter tuning and transfer loss, enabling generalizable locomotion across robot morphologies. We use a morphology-conditioned policy by incorporating a randomized morphology vector, sampled from a defined morphology range, into both the actor and critic networks. This allows the policy to learn parameters that generalize to robots with similar characteristics. We demonstrate that a single policy trained on a Unitree Go1 robot using McARL can be transferred to a different morphology (e.g., Unitree Go2 robot) and can achieve zero-shot transfer velocity of up to 3.5 m/s without retraining or fine-tuning. Moreover, it achieves 6.0 m/s on the training Go1 robot and generalizes to other morphologies like A1 and Mini Cheetah. We also analyze the impact of morphology distance on transfer performance and highlight McARL's advantages over prior approaches. McARL achieves 44-150% higher transfer performance on Go2, Mini Cheetah, and A1 compared to PPO variants.

Problem

Research questions and friction points this paper is trying to address.

Overcoming hyperparameter tuning and transfer loss challenges

Enabling generalizable locomotion across robot morphologies

Achieving zero-shot transfer without retraining or fine-tuning

Innovation

Methods, ideas, or system contributions that make the work stand out.

Morphology-conditioned policy with randomized vector

Single policy generalizes across robot morphologies

Achieves high-speed zero-shot transfer without retraining

🔎 Similar Papers

No similar papers found.

Amazon

142,800.00 - 193,200.00 USD annually

N.Reading, MA, USA

Robotics Autonomy Engineer-Planning and Control

Field AI

Irvine, CA

Research Scientist Intern, Robotic Control Policy (PhD)