🤖 AI Summary
To address the challenges of rapid deployment and dynamic 3D coverage of unmanned aerial base stations (UABs) in emergency communications, this paper proposes a deep reinforcement learning-based collaborative trajectory optimization method. The approach integrates proximal policy optimization (PPO) with realistic radio-frequency (RF) channel state information—marking the first such integration for adaptive UAB positioning. It generalizes across diverse user equipment (UE) mobility patterns: static, linear, circular, and hybrid. The method enables real-time response and continuous coverage over large-scale, spatially distributed UEs. Evaluated across five representative mobility scenarios, it achieves >98% area coverage, improves average signal-to-interference-plus-noise ratio (SINR) by 12–27% over greedy and Q-learning baselines, and accelerates policy convergence by 3.5×. These advances significantly enhance timeliness in life-critical rescue operations and robustness of emergency wireless connectivity.
📝 Abstract
Unmanned aerial vehicle (UAV)-based base stations offer a promising solution in emergencies where the rapid deployment of cutting-edge networks is crucial for maximizing life-saving potential. Optimizing the strategic positioning of these UAVs is essential for enhancing communication efficiency. This paper introduces an automated reinforcement learning approach that enables UAVs to dynamically interact with their environment and determine optimal configurations. By leveraging the radio signal sensing capabilities of communication networks, our method provides a more realistic perspective, utilizing state-of-the-art algorithm -- proximal policy optimization -- to learn and generalize positioning strategies across diverse user equipment (UE) movement patterns. We evaluate our approach across various UE mobility scenarios, including static, random, linear, circular, and mixed hotspot movements. The numerical results demonstrate the algorithm's adaptability and effectiveness in maintaining comprehensive coverage across all movement patterns.