Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning

📅 2024-06-02
🏛️ 2024 IEEE Intelligent Vehicles Symposium (IV)
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Real-time, long-range, and dynamically constrained terrain-following/terrain-avoidance (TF/TA) path planning for medium-to-large fixed-wing UAVs in complex topography remains challenging due to stringent aerodynamic, kinematic, and obstacle-avoidance requirements—especially without global terrain priors. Method: This paper proposes MD-RRT, a fluid-field modeling approach that uniquely integrates perturbed flow fields with artificial potential fields to reconstruct the state-action space of Proximal Policy Optimization (PPO), enabling end-to-end, dynamically feasible planning without global prior knowledge. The method unifies an improved RRT-based sampling strategy, high-fidelity fixed-wing aircraft dynamics modeling, and the PPO reinforcement learning framework. Results: Validated on real-world digital elevation model (DEM) data, the system generates kilometer-scale, collision-free trajectories in real time while satisfying aerodynamic, kinematic, and obstacle-avoidance constraints. It robustly accomplishes TF/TA missions under severe terrain undulation and dynamic obstacles, significantly overcoming the fundamental trade-off among real-time performance, planning horizon, and constraint compliance inherent in existing approaches.

Technology Category

Application Category

📝 Abstract
Existing local dynamic route planning algorithms, when directly applied to terrain following/terrain avoidance, or dynamic obstacle avoidance for large and medium-sized fixed-wing aircraft, fail to simultaneously meet the requirements of real-time performance, long-distance planning, and the dynamic constraints of large and medium-sized aircraft. To deal with this issue, this paper proposes the Motion Dynamic RRT based Fluid Field - PPO for dynamic TF/TA routing planning. Firstly, the action and state spaces of the proximal policy gradient algorithm are redesigned using disturbance flow fields and artificial potential field algorithms, establishing an aircraft dynamics model, and designing a state transition process based on this model. Additionally, a reward function is designed to encourage strategies for obstacle avoidance, terrain following, terrain avoidance, and safe flight. Experimental results on real DEM data demonstrate that our algorithm can complete long-distance flight tasks through collision-free trajectory planning that complies with dynamic constraints, without the need for prior global planning.
Problem

Research questions and friction points this paper is trying to address.

Flight Path Planning
Obstacle Avoidance
Real-time Adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic RRT
PPO Algorithm
Complex Terrain Path Planning
🔎 Similar Papers
No similar papers found.
R
Rongkun Xue
School of Automation Science and Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
J
Jing Yang
School of Automation Science and Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
Yuyang Jiang
Yuyang Jiang
HKUST/CAA
Art therapyArt educationArt metaverse
Y
Yiming Feng
School of Automation Science and Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
Z
Zi Yang
School of Automation Science and Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China