Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding

📅 2025-03-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of mode-switching identification and control in hybrid dynamical systems—exemplified by quadrupedal robot skateboarding—where conventional methods rely on trajectory segmentation or explicit event-function learning. We propose an end-to-end reinforcement learning framework that eliminates both requirements. Methodologically, we formulate a discrete-time hybrid automaton learning paradigm, integrating online policy-gradient optimization, Beta-distribution-based stochastic policies, a multi-discriminator architecture, and contact-aware motion representations. To our knowledge, this is the first approach to achieve unsupervised joint learning of discrete modes and continuous flows in high-dimensional rigid-body dynamics. Experiments demonstrate successful skateboarding execution in simulation and on real quadrupedal robots. The method significantly improves robustness to nonstationary contact dynamics and cross-scenario generalization. It establishes a novel paradigm for contact-guided complex locomotion control.

Technology Category

Application Category

📝 Abstract
This paper introduces Discrete-time Hybrid Automata Learning (DHAL), a framework using on-policy Reinforcement Learning to identify and execute mode-switching without trajectory segmentation or event function learning. Hybrid dynamical systems, which include continuous flow and discrete mode switching, can model robotics tasks like legged robot locomotion. Model-based methods usually depend on predefined gaits, while model-free approaches lack explicit mode-switching knowledge. Current methods identify discrete modes via segmentation before regressing continuous flow, but learning high-dimensional complex rigid body dynamics without trajectory labels or segmentation is a challenging open problem. Our approach incorporates a beta policy distribution and a multi-critic architecture to model contact-guided motions, exemplified by a challenging quadrupedal robot skateboard task. We validate our method through simulations and real-world tests, demonstrating robust performance in hybrid dynamical systems.
Problem

Research questions and friction points this paper is trying to address.

Autonomous mode-switching in hybrid dynamical systems without trajectory segmentation.
Overcoming reliance on predefined gaits for legged robot locomotion modeling.
Learning high-dimensional rigid body dynamics without trajectory labels or segmentation.
Innovation

Methods, ideas, or system contributions that make the work stand out.

On-policy RL for mode-switching without trajectory segmentation
Beta policy distribution with multi-critic contact modeling
Quadrupedal skateboard task validation in hybrid dynamics
🔎 Similar Papers
No similar papers found.
H
Hang Liu
University of Michigan
Sangli Teng
Sangli Teng
Berkeley AI Research
Roboticscontrol
B
Ben Liu
Southern University of Science and Technology
W
Wei Zhang
Southern University of Science and Technology
Maani Ghaffari
Maani Ghaffari
Assistant Professor, University of Michigan
RoboticsMachine LearningRobot PerceptionAutonomous NavigationComputational Symmetry