Reference Free Platform Adaptive Locomotion for Quadrupedal Robots using a Dynamics Conditioned Policy

📅 2025-05-21

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

Existing quadrupedal robot controllers rely heavily on predefined reference models, limiting adaptability across robots with diverse morphologies and dynamics. Method: This paper proposes Platform-Adaptive Locomotion (PAL), a deep reinforcement learning–based framework that trains a single policy mapping proprioceptive states and velocity commands to joint targets. PAL introduces a novel morphology-aware implicit dynamics conditioning mechanism—eliminating the need for explicit reference models—and integrates a GRU-based dynamics encoder with a morphology attribute estimator. Contribution/Results: PAL enables zero-shot transfer across unseen robots. On the ANYmal C hardware platform, the morphology-aware variant reduces speed tracking error by 30% compared to temporal encoding alone. Extensive evaluation across multiple simulated platforms demonstrates robust generalization and cross-platform adaptability.

Technology Category

Application Category

📝 Abstract

This article presents Platform Adaptive Locomotion (PAL), a unified control method for quadrupedal robots with different morphologies and dynamics. We leverage deep reinforcement learning to train a single locomotion policy on procedurally generated robots. The policy maps proprioceptive robot state information and base velocity commands into desired joint actuation targets, which are conditioned using a latent embedding of the temporally local system dynamics. We explore two conditioning strategies - one using a GRU-based dynamics encoder and another using a morphology-based property estimator - and show that morphology-aware conditioning outperforms temporal dynamics encoding regarding velocity task tracking for our hardware test on ANYmal C. Our results demonstrate that both approaches achieve robust zero-shot transfer across multiple unseen simulated quadrupeds. Furthermore, we demonstrate the need for careful robot reference modelling during training, enabling us to reduce the velocity tracking error by up to 30% compared to the baseline method. Despite PAL not surpassing the best-performing reference-free controller in all cases, our analysis uncovers critical design choices and informs improvements to the state of the art.

Problem

Research questions and friction points this paper is trying to address.

Develop unified control for diverse quadrupedal robot morphologies

Enable robust zero-shot transfer across unseen simulated quadrupeds

Reduce velocity tracking error via improved reference modeling

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses deep reinforcement learning for locomotion policy

Employs latent dynamics embedding for joint control

Tests GRU and morphology-based conditioning strategies

🔎 Similar Papers

No similar papers found.

Field AI

Irvine, CA

Applied Scientist II, Reinforcement Learning

Amazon

142,800.00 - 193,200.00 USD annually

N.Reading, MA, USA

Research Scientist Intern, Robotic Control Policy (PhD)