UAV-Enabled Data Collection for IoT Networks via Rainbow Learning

📅 2024-09-22
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the joint optimization of unmanned aerial vehicle (UAV) three-dimensional trajectory, MIMO receive beamforming, ground node (GN) scheduling, and transmit power allocation in multi-antenna UAV-assisted IoT data collection, aiming to maximize the sum data collection (SDC). The problem is highly coupled and non-convex, rendering conventional convex optimization techniques inapplicable. To tackle this challenge, we propose a dual-loop optimization-driven deep reinforcement learning (DRL) framework and design an end-to-end fully DRL solution based on Rainbow DQN, enabling efficient coordinated decision-making over a continuous–discrete hybrid action space. Simulation results demonstrate that the proposed approach significantly enhances SDC in dynamic environments, achieving an average gain of 32.7% over benchmark algorithms. It exhibits both high computational efficiency and strong robustness, establishing a novel paradigm for non-convex joint resource optimization in UAV-assisted wireless networks.

Technology Category

Application Category

📝 Abstract
Unmanned aerial vehicles (UAVs) assisted Internet of things (IoT) systems have become an important part of future wireless communications. To achieve higher communication rate, the joint design of UAV trajectory and resource allocation is crucial. This letter considers a scenario where a multi-antenna UAV is dispatched to simultaneously collect data from multiple ground IoT nodes (GNs) within a time interval. To improve the sum data collection (SDC) volume, i.e., the total data volume transmitted by the GNs, the UAV trajectory, the UAV receive beamforming, the scheduling of the GNs, and the transmit power of the GNs are jointly optimized. Since the problem is non-convex and the optimization variables are highly coupled, it is hard to solve using traditional optimization methods. To find a near-optimal solution, a double-loop structured optimization-driven deep reinforcement learning (DRL) algorithm and a fully DRL-based algorithm are proposed to solve the problem effectively. Simulation results verify that the proposed algorithms outperform two benchmarks with significant improvement in SDC volumes.
Problem

Research questions and friction points this paper is trying to address.

Optimize UAV trajectory and resource allocation for IoT data collection
Enhance sum data collection volume from ground IoT nodes
Solve non-convex coupled variables via rainbow learning algorithms
Innovation

Methods, ideas, or system contributions that make the work stand out.

UAV trajectory and resource joint optimization
Double-loop rainbow learning based algorithm
Deep reinforcement learning for IoT data collection
🔎 Similar Papers
No similar papers found.
Y
Yingchao Jiao
Shenzhen Institute of Advanced Technology (SIAT), Chinese Academy of Sciences, Guangdong 518055, China; University of Chinese Academy of Sciences, Beijing 100049, China
X
Xuhui Zhang
Shenzhen Future Network of Intelligence Institute (FNii-Shenzhen), the School of Science and Engineering (SSE), and the Guangdong Provincial Key Laboratory of Future Networks of Intelligence, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
Wenchao Liu
Wenchao Liu
Southern University of Science and Technology
UAV communicationMobile Edge ComputingLow-Altitude Wireless Network
Yinyu Wu
Yinyu Wu
Shenzhen Institute of Advanced Technology (SIAT), Chinese Academy of Sciences, Guangdong 518055, China; University of Chinese Academy of Sciences, Beijing 100049, China
J
Jinke Ren
Shenzhen Future Network of Intelligence Institute (FNii-Shenzhen), the School of Science and Engineering (SSE), and the Guangdong Provincial Key Laboratory of Future Networks of Intelligence, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
Yanyan Shen
Yanyan Shen
Shanghai Jiao Tong University
Data managementData analyticsMachine learning systems
B
Bo Yang
Department of Automation and the Key Laboratory of System Control and Information Processing, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
Xinping Guan
Xinping Guan
Shanghai Jiao Tong University
Wireless Networks and ApplicationsInternet of ThingsControl and Systems