Hierarchical Task Offloading and Trajectory Optimization in Low-Altitude Intelligent Networks Via Auction and Diffusion-based MARL

📅 2025-12-05
🏛️ IEEE Transactions on Cognitive Communications and Networking
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In Low-Altitude Intelligent Networks (LAINs), energy-constrained UAVs face high latency and low energy efficiency due to dynamically arriving tasks and tightly coupled heterogeneous computing resources. Method: This paper proposes a hierarchical, dual-timescale air-ground cooperative optimization framework: at the macro-timescale, a VCG auction mechanism—energy-efficiency-aware and incentive-compatible—is employed for UAV trajectory allocation; at the micro-timescale, we introduce a novel heterogeneous multi-agent PPO algorithm where a diffusion model is embedded into the Actor network, enabling observation-conditioned denoising to enhance policy diversity and environmental adaptability. Contribution/Results: Formulated as an integer nonlinear program and jointly optimized via MARL, the framework achieves a 92.7% task success rate, significantly improves energy efficiency, and converges over 40% faster than state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
The low-altitude intelligent networks (LAINs) emerge as a promising architecture for delivering low-latency and energy-efficient edge intelligence in dynamic and infrastructure-limited environments. By integrating unmanned aerial vehicles (UAVs), aerial base stations, and terrestrial base stations, LAINs can support mission-critical applications such as disaster response, environmental monitoring, and real-time sensing. However, these systems face key challenges, including energy-constrained UAVs, stochastic task arrivals, and heterogeneous computing resources. To address these issues, we propose an integrated air-ground collaborative network and formulate a time-dependent integer nonlinear programming problem that jointly optimizes UAV trajectory planning and task offloading decisions. The problem is challenging to solve due to temporal coupling among decision variables. Therefore, we design a hierarchical learning framework with two timescales. At the large timescale, a Vickrey-Clarke-Groves auction mechanism enables the energy-aware and incentive-compatible trajectory assignment. At the small timescale, we propose the diffusion-heterogeneous-agent proximal policy optimization, a generative multi-agent reinforcement learning algorithm that embeds latent diffusion models into actor networks. Each UAV samples actions from a Gaussian prior and refines them via observation-conditioned denoising, enhancing adaptability and policy diversity. Extensive simulations show that our framework outperforms baselines in energy efficiency, task success rate, and convergence performance.
Problem

Research questions and friction points this paper is trying to address.

Optimizes UAV trajectory planning and task offloading decisions
Addresses energy constraints and stochastic tasks in low-altitude networks
Solves time-dependent integer nonlinear programming with hierarchical learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical learning with two timescales optimization
Vickrey-Clarke-Groves auction for energy-aware trajectory assignment
Diffusion-heterogeneous-agent proximal policy optimization algorithm
J
Jiahao You
Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
Z
Ziye Jia
Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China, and also with the National Mobile Communications Research Laboratory, Southeast University, Nanjing, Jiangsu, 211111, China
C
Can Cui
Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
Chao Dong
Chao Dong
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
image restorationincluding super-resolutiondenoisingetc.
Qihui Wu
Qihui Wu
Professor, Nanjing University of Aeronautics and Astronautics, Nanjing, China
Cognitive RadioUAV Communications
Z
Zhu Han
University of Houston, Houston, TX 77004 USA, and also with the Department of Computer Science and Engineering, Kyung Hee University, Seoul 446-701, South Korea