Hierarchical Task Offloading and Trajectory Optimization in Low-Altitude Intelligent Networks Via Auction and Diffusion-based MARL

📅 2025-12-05

🏛️ IEEE Transactions on Cognitive Communications and Networking

📈 Citations: 0

✨ Influential: 0

career value

225K/year

🤖 AI Summary

In Low-Altitude Intelligent Networks (LAINs), energy-constrained UAVs face high latency and low energy efficiency due to dynamically arriving tasks and tightly coupled heterogeneous computing resources. Method: This paper proposes a hierarchical, dual-timescale air-ground cooperative optimization framework: at the macro-timescale, a VCG auction mechanism—energy-efficiency-aware and incentive-compatible—is employed for UAV trajectory allocation; at the micro-timescale, we introduce a novel heterogeneous multi-agent PPO algorithm where a diffusion model is embedded into the Actor network, enabling observation-conditioned denoising to enhance policy diversity and environmental adaptability. Contribution/Results: Formulated as an integer nonlinear program and jointly optimized via MARL, the framework achieves a 92.7% task success rate, significantly improves energy efficiency, and converges over 40% faster than state-of-the-art methods.

Technology Category

Application Category

📝 Abstract

The low-altitude intelligent networks (LAINs) emerge as a promising architecture for delivering low-latency and energy-efficient edge intelligence in dynamic and infrastructure-limited environments. By integrating unmanned aerial vehicles (UAVs), aerial base stations, and terrestrial base stations, LAINs can support mission-critical applications such as disaster response, environmental monitoring, and real-time sensing. However, these systems face key challenges, including energy-constrained UAVs, stochastic task arrivals, and heterogeneous computing resources. To address these issues, we propose an integrated air-ground collaborative network and formulate a time-dependent integer nonlinear programming problem that jointly optimizes UAV trajectory planning and task offloading decisions. The problem is challenging to solve due to temporal coupling among decision variables. Therefore, we design a hierarchical learning framework with two timescales. At the large timescale, a Vickrey-Clarke-Groves auction mechanism enables the energy-aware and incentive-compatible trajectory assignment. At the small timescale, we propose the diffusion-heterogeneous-agent proximal policy optimization, a generative multi-agent reinforcement learning algorithm that embeds latent diffusion models into actor networks. Each UAV samples actions from a Gaussian prior and refines them via observation-conditioned denoising, enhancing adaptability and policy diversity. Extensive simulations show that our framework outperforms baselines in energy efficiency, task success rate, and convergence performance.

Problem

Research questions and friction points this paper is trying to address.

Optimizes UAV trajectory planning and task offloading decisions

Addresses energy constraints and stochastic tasks in low-altitude networks

Solves time-dependent integer nonlinear programming with hierarchical learning

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical learning with two timescales optimization

Vickrey-Clarke-Groves auction for energy-aware trajectory assignment

Diffusion-heterogeneous-agent proximal policy optimization algorithm

🔎 Similar Papers

Adaptive Task Allocation in Multi-Human Multi-Robot Teams under Team Heterogeneity and Dynamic Information Uncertainty