Optimizing UAV Aerial Base Station Flights Using DRL-based Proximal Policy Optimization

📅 2025-04-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of rapid deployment and dynamic 3D coverage of unmanned aerial base stations (UABs) in emergency communications, this paper proposes a deep reinforcement learning-based collaborative trajectory optimization method. The approach integrates proximal policy optimization (PPO) with realistic radio-frequency (RF) channel state information—marking the first such integration for adaptive UAB positioning. It generalizes across diverse user equipment (UE) mobility patterns: static, linear, circular, and hybrid. The method enables real-time response and continuous coverage over large-scale, spatially distributed UEs. Evaluated across five representative mobility scenarios, it achieves >98% area coverage, improves average signal-to-interference-plus-noise ratio (SINR) by 12–27% over greedy and Q-learning baselines, and accelerates policy convergence by 3.5×. These advances significantly enhance timeliness in life-critical rescue operations and robustness of emergency wireless connectivity.

Technology Category

Application Category

📝 Abstract
Unmanned aerial vehicle (UAV)-based base stations offer a promising solution in emergencies where the rapid deployment of cutting-edge networks is crucial for maximizing life-saving potential. Optimizing the strategic positioning of these UAVs is essential for enhancing communication efficiency. This paper introduces an automated reinforcement learning approach that enables UAVs to dynamically interact with their environment and determine optimal configurations. By leveraging the radio signal sensing capabilities of communication networks, our method provides a more realistic perspective, utilizing state-of-the-art algorithm -- proximal policy optimization -- to learn and generalize positioning strategies across diverse user equipment (UE) movement patterns. We evaluate our approach across various UE mobility scenarios, including static, random, linear, circular, and mixed hotspot movements. The numerical results demonstrate the algorithm's adaptability and effectiveness in maintaining comprehensive coverage across all movement patterns.
Problem

Research questions and friction points this paper is trying to address.

Optimizing UAV base station positioning for emergency networks
Enhancing communication efficiency via dynamic UAV-environment interaction
Adapting to diverse user equipment movement patterns effectively
Innovation

Methods, ideas, or system contributions that make the work stand out.

DRL-based Proximal Policy Optimization for UAV positioning
Dynamic interaction with environment for optimal configurations
Adaptive coverage across diverse UE movement patterns
🔎 Similar Papers
No similar papers found.