Learning Quadruped Walking from Seconds of Demonstration

📅 2026-03-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of learning robust locomotion policies for quadrupedal robots from extremely limited human demonstrations. To this end, the authors propose a novel approach that integrates dynamical systems theory with imitation learning. By explicitly modeling the limit-cycle structure of gaits and the associated Poincaré return map, they introduce a consistency regularization mechanism between the latent representation and action outputs to enforce local dynamic alignment. The method enables offline training of diverse, robust walking policies using only a few seconds of human demonstration, substantially improving data efficiency and generalization. Hardware experiments on a real quadruped platform demonstrate the approach’s effectiveness and deployment feasibility, offering insights into the underlying mechanisms that enable successful few-shot imitation learning for quadrupedal locomotion.

Technology Category

Application Category

📝 Abstract
Quadruped locomotion provides a natural setting for understanding when model-free learning can outperform model-based control design, by exploiting data patterns to bypass the difficulty of optimizing over discrete contacts and the combinatorial explosion of mode changes. We give a principled analysis of why imitation learning with quadrupeds can be inherently effective in a small data regime, based on the structure of its limit cycles, Poincar\'e return maps, and local numerical properties of neural networks. The understanding motivates a new imitation learning method that regulates the alignment between variations in a latent space and those over the output actions. Hardware experiments confirm that a few seconds of demonstration is sufficient to train various locomotion policies from scratch entirely offline with reasonable robustness.
Problem

Research questions and friction points this paper is trying to address.

quadruped locomotion
imitation learning
small data regime
limit cycles
Poincaré return maps
Innovation

Methods, ideas, or system contributions that make the work stand out.

imitation learning
quadruped locomotion
limit cycles
Poincaré return maps
latent space alignment
🔎 Similar Papers
No similar papers found.