Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning

📅 2024-06-02

🏛️ 2024 IEEE Intelligent Vehicles Symposium (IV)

📈 Citations: 0

✨ Influential: 0

career value

171K/year

🤖 AI Summary

Real-time, long-range, and dynamically constrained terrain-following/terrain-avoidance (TF/TA) path planning for medium-to-large fixed-wing UAVs in complex topography remains challenging due to stringent aerodynamic, kinematic, and obstacle-avoidance requirements—especially without global terrain priors. Method: This paper proposes MD-RRT, a fluid-field modeling approach that uniquely integrates perturbed flow fields with artificial potential fields to reconstruct the state-action space of Proximal Policy Optimization (PPO), enabling end-to-end, dynamically feasible planning without global prior knowledge. The method unifies an improved RRT-based sampling strategy, high-fidelity fixed-wing aircraft dynamics modeling, and the PPO reinforcement learning framework. Results: Validated on real-world digital elevation model (DEM) data, the system generates kilometer-scale, collision-free trajectories in real time while satisfying aerodynamic, kinematic, and obstacle-avoidance constraints. It robustly accomplishes TF/TA missions under severe terrain undulation and dynamic obstacles, significantly overcoming the fundamental trade-off among real-time performance, planning horizon, and constraint compliance inherent in existing approaches.

Technology Category

Application Category

📝 Abstract

Existing local dynamic route planning algorithms, when directly applied to terrain following/terrain avoidance, or dynamic obstacle avoidance for large and medium-sized fixed-wing aircraft, fail to simultaneously meet the requirements of real-time performance, long-distance planning, and the dynamic constraints of large and medium-sized aircraft. To deal with this issue, this paper proposes the Motion Dynamic RRT based Fluid Field - PPO for dynamic TF/TA routing planning. Firstly, the action and state spaces of the proximal policy gradient algorithm are redesigned using disturbance flow fields and artificial potential field algorithms, establishing an aircraft dynamics model, and designing a state transition process based on this model. Additionally, a reward function is designed to encourage strategies for obstacle avoidance, terrain following, terrain avoidance, and safe flight. Experimental results on real DEM data demonstrate that our algorithm can complete long-distance flight tasks through collision-free trajectory planning that complies with dynamic constraints, without the need for prior global planning.

Problem

Research questions and friction points this paper is trying to address.

Flight Path Planning

Obstacle Avoidance

Real-time Adaptation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic RRT

PPO Algorithm

Complex Terrain Path Planning

🔎 Similar Papers

No similar papers found.