🤖 AI Summary
This work addresses the lack of formal safety guarantees for learning-based quadrupedal robots operating under model uncertainty, perceptual noise, and unstructured contacts. The authors propose a differentiable stochastic safe control framework that, for the first time, integrates variance-aware probabilistic control barrier functions into a differentiable quadratic program. This approach incorporates semantic-driven safety margin modulation and a meta-adaptive risk calibration mechanism, enabling end-to-end trainable safe locomotion while providing theoretical guarantees such as probabilistic forward invariance. Evaluated on Unitree A1 and ANYmal C platforms, the system runs in real time at 200 Hz and, across twelve challenging terrains, reduces safety violations by 3–10× and energy consumption by 10–15% compared to state-of-the-art methods based on control barrier functions, model predictive control, and hybrid reinforcement learning.
📝 Abstract
Learning-based quadruped controllers achieve impressive agility but typically lack formal safety guarantees under model uncertainty, perception noise, and unstructured contact conditions. We introduce SafeMind, a differentiable stochastic safety-control framework that unifies probabilistic Control Barrier Functions with semantic context understanding and meta-adaptive risk calibration. SafeMind explicitly models epistemic and aleatoric uncertainty through a variance-aware barrier constraint embedded in a differentiable quadratic program, thereby preserving gradient flow for end-to-end training. A semantics-to-constraint encoder modulates safety margins using perceptual or language cues, while a meta-adaptive learner continuously adjusts risk sensitivity across environments. We provide theoretical conditions for probabilistic forward invariance, feasibility, and stability under stochastic dynamics. SafeMind is deployed on Unitree A1 and ANYmal C at 200 Hz and validated across 12 terrain types, dynamic obstacles, morphology perturbations, and semantically defined tasks. Experiments show that SafeMind reduces safety violations by 3– $10\times $ and energy consumption by 10–15% relative to state-of-the-art CBF, MPC, and hybrid RL baselines, while maintaining real-time control performance.