SAC-Loco: Safe and Adjustable Compliant Quadrupedal Locomotion

📅 2025-09-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing quadrupedal robot control methods lack adaptive, tunable compliance against external disturbances, leading to instability or falls under strong perturbations. This paper proposes a force-sensor-free online compliance modulation framework that integrates tunable-compliance reinforcement learning (RL), a Capture Point (CoP)-based safety policy, and a recoverability prediction network to enable dynamic switching between compliant locomotion and fall recovery. Innovatively, we introduce a teacher-student RL architecture and a neural-network-driven proactive policy-switching mechanism triggered prior to failure. Evaluated in simulation and on physical hardware, the method significantly enhances dynamic stability and motion safety under severe disturbances, enabling smooth recovery and continuous locomotion. To our knowledge, this is the first approach achieving real-time, physics-based-force-feedback-free compliance adaptation and robust policy transfer across perturbation regimes.

Technology Category

Application Category

📝 Abstract
Quadruped robots are designed to achieve agile locomotion by mimicking legged animals. However, existing control methods for quadrupeds often lack one of the key capabilities observed in animals: adaptive and adjustable compliance in response to external disturbances. Most locomotion controllers do not provide tunable compliance and tend to fail under large perturbations. In this work, we propose a switched policy framework for compliant and safe quadruped locomotion. First, we train a force compliant policy with adjustable compliance levels using a teacher student reinforcement learning framework, eliminating the need for explicit force sensing. Next, we develop a safe policy based on the capture point concept to stabilize the robot when the compliant policy fails. Finally, we introduce a recoverability network that predicts the likelihood of failure and switches between the compliant and safe policies. Together, this framework enables quadruped robots to achieve both force compliance and robust safety when subjected to severe external disturbances.
Problem

Research questions and friction points this paper is trying to address.

Achieving adaptive compliance in quadruped robots under disturbances
Providing tunable compliance without explicit force sensing
Ensuring safe locomotion recovery from large external perturbations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Switched policy framework for compliant locomotion
Force compliant policy without explicit force sensing
Recoverability network predicting failure for policy switching
🔎 Similar Papers
No similar papers found.
A
Aoqian Zhang
Department of Electrical and Computer Engineering, National University of Singapore
Z
Zixuan Zhuang
School of Computer Science and Engineering, Sun Yat-sen University
C
Chunzheng Wang
Department of Electrical and Computer Engineering, National University of Singapore
Shuzhi Sam Ge
Shuzhi Sam Ge
Department of Electrical and Computer Engineering, National University of Singapore
F
Fan Shi
Department of Electrical and Computer Engineering, National University of Singapore
Cheng Xiang
Cheng Xiang
National University of Singapore
Control systemscomputer visionmachine learningartificial intelligence