High-Speed Vision-Based Flight in Clutter with Safety-Shielded Reinforcement Learning

πŸ“… 2026-02-09
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenges faced by quadrotor drones in high-speed autonomous obstacle avoidance, where traditional modular pipelines suffer from high latency and pure reinforcement learning approaches lack safety guarantees. The authors propose an end-to-end reinforcement learning framework integrated with a model-driven safety mechanism: during training, physics-informed priors shape the reward function, while at deployment, a real-time safety filter projects the policy’s output onto a provably safe action set. This approach uniquely unifies high-speed flight with rigorous safety constraints within an end-to-end learning paradigm. Evaluated in both dense indoor obstacle courses and real-world forest environments, the system achieves stable flight at 7.5 m/s, outperforming conventional planners and existing end-to-end methods in both performance and robustness, while demonstrating exceptional generalization and safety.

Technology Category

Application Category

πŸ“ Abstract
Quadrotor unmanned aerial vehicles (UAVs) are increasingly deployed in complex missions that demand reliable autonomous navigation and robust obstacle avoidance. However, traditional modular pipelines often incur cumulative latency, whereas purely reinforcement learning (RL) approaches typically provide limited formal safety guarantees. To bridge this gap, we propose an end-to-end RL framework augmented with model-based safety mechanisms. We incorporate physical priors in both training and deployment. During training, we design a physics-informed reward structure that provides global navigational guidance. During deployment, we integrate a real-time safety filter that projects the policy outputs onto a provably safe set to enforce strict collision-avoidance constraints. This hybrid architecture reconciles high-speed flight with robust safety assurances. Benchmark evaluations demonstrate that our method outperforms both traditional planners and recent end-to-end obstacle avoidance approaches based on differentiable physics. Extensive experiments demonstrate strong generalization, enabling reliable high-speed navigation in dense clutter and challenging outdoor forest environments at velocities up to 7.5m/s.
Problem

Research questions and friction points this paper is trying to address.

high-speed flight
obstacle avoidance
safety guarantees
quadrotor UAV
cluttered environments
Innovation

Methods, ideas, or system contributions that make the work stand out.

Safety-Shielded Reinforcement Learning
Physics-Informed Reward
Real-Time Safety Filter
End-to-End UAV Navigation
High-Speed Obstacle Avoidance
πŸ”Ž Similar Papers
No similar papers found.
J
Jiarui Zhang
State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027, China; Differential Robotics, Hangzhou 311121, China
C
Chengyong Lei
State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027, China; Differential Robotics, Hangzhou 311121, China
C
Chengjiang Dai
State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027, China; Differential Robotics, Hangzhou 311121, China
Lijie Wang
Lijie Wang
FASTLab, Zhejiang University
Multi-Robot SystemMulti-Sensor FusionRobot Perception
Zhichao Han
Zhichao Han
Zhejiang University, PHD Student
Robotics
Fei Gao
Fei Gao
Associate Professor, Zhejiang University
Aerial RoboticsMotion PlanningAutonomous Navigation