SAC-Loco: Safe and Adjustable Compliant Quadrupedal Locomotion

📅 2025-09-27

📈 Citations: 0

✨ Influential: 0

career value

180K/year

🤖 AI Summary

Existing quadrupedal robot control methods lack adaptive, tunable compliance against external disturbances, leading to instability or falls under strong perturbations. This paper proposes a force-sensor-free online compliance modulation framework that integrates tunable-compliance reinforcement learning (RL), a Capture Point (CoP)-based safety policy, and a recoverability prediction network to enable dynamic switching between compliant locomotion and fall recovery. Innovatively, we introduce a teacher-student RL architecture and a neural-network-driven proactive policy-switching mechanism triggered prior to failure. Evaluated in simulation and on physical hardware, the method significantly enhances dynamic stability and motion safety under severe disturbances, enabling smooth recovery and continuous locomotion. To our knowledge, this is the first approach achieving real-time, physics-based-force-feedback-free compliance adaptation and robust policy transfer across perturbation regimes.

Technology Category

Application Category

📝 Abstract

Quadruped robots are designed to achieve agile locomotion by mimicking legged animals. However, existing control methods for quadrupeds often lack one of the key capabilities observed in animals: adaptive and adjustable compliance in response to external disturbances. Most locomotion controllers do not provide tunable compliance and tend to fail under large perturbations. In this work, we propose a switched policy framework for compliant and safe quadruped locomotion. First, we train a force compliant policy with adjustable compliance levels using a teacher student reinforcement learning framework, eliminating the need for explicit force sensing. Next, we develop a safe policy based on the capture point concept to stabilize the robot when the compliant policy fails. Finally, we introduce a recoverability network that predicts the likelihood of failure and switches between the compliant and safe policies. Together, this framework enables quadruped robots to achieve both force compliance and robust safety when subjected to severe external disturbances.

Problem

Research questions and friction points this paper is trying to address.

Achieving adaptive compliance in quadruped robots under disturbances

Providing tunable compliance without explicit force sensing

Ensuring safe locomotion recovery from large external perturbations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Switched policy framework for compliant locomotion

Force compliant policy without explicit force sensing

Recoverability network predicting failure for policy switching

🔎 Similar Papers

Playful DoggyBot: Learning Agile and Precise Quadrupedal Locomotion