SEA-Nav: Efficient Policy Learning for Safe and Agile Quadruped Navigation in Cluttered Environments

📅 2026-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of enabling quadrupedal robots to achieve safe, agile navigation in densely cluttered environments while maintaining high training efficiency. To this end, the authors propose the SEA-Nav framework, which integrates differentiable control barrier function (CBF)-based safety constraints, an adaptive collision replay mechanism, risk-aware exploration rewards, and kinematic action constraints within a reinforcement learning paradigm. This unified approach effectively balances safety guarantees with exploration efficiency during policy learning. Notably, SEA-Nav achieves, for the first time on a real quadrupedal robot, successful navigation through highly complex obstacle courses after only minutes of training, substantially improving both sample efficiency and deployment safety compared to prior methods.

Technology Category

Application Category

📝 Abstract
Efficiently training quadruped robot navigation in densely cluttered environments remains a significant challenge. Existing methods are either limited by a lack of safety and agility in simple obstacle distributions or suffer from slow locomotion in complex environments, often requiring excessively long training phases. To this end, we propose SEA-Nav (Safe, Efficient, and Agile Navigation), a reinforcement learning framework for quadruped navigation. Within diverse and dense obstacle environments, a differentiable control barrier function (CBF)-based shield constraints the navigation policy to output safe velocity commands. An adaptive collision replay mechanism and hazardous exploration rewards are introduced to increase the probability of learning from critical experiences, guiding efficient exploration and exploitation. Finally, kinematic action constraints are incorporated to ensure safe velocity commands, facilitating successful physical deployment. To the best of our knowledge, this is the first approach that achieves highly challenging quadruped navigation in the real world with minute-level training time.
Problem

Research questions and friction points this paper is trying to address.

quadruped navigation
cluttered environments
safe navigation
agile locomotion
efficient training
Innovation

Methods, ideas, or system contributions that make the work stand out.

differentiable control barrier function
adaptive collision replay
hazardous exploration rewards
kinematic action constraints
quadruped navigation
🔎 Similar Papers
No similar papers found.
Shiyi Chen
Shiyi Chen
Professor, College of Engineering, EIT and SUSTech
fluid mechanicsturbulenceComputational fluid dynamicslattice Boltzmann
M
Mingye Yang
Imperial College London, London, UK
H
Haiyan Mao
Tsinghua University, Beijing, China
J
Jiaqi Zhang
Tsinghua University, Beijing, China
H
Haiyi Liu
Tsinghua University, Beijing, China
S
Shuheng He
Tsinghua University, Beijing, China
Debing Zhang
Debing Zhang
Xiaohongshu
Machine LearningComputer VisionDeep Learning
Z
Zihao Qiu
Tsinghua University, Beijing, China
Chun Zhang
Chun Zhang
Tsinghua University, Beijing Visual Science and Translational Eye Research Institute (BERI)
glaucomastem cellganglion cellophthalmologydevice