Collaborative Intelligence for UAV-Satellite Network Slicing: Towards a Joint QoS-Energy-Fairness MADRL Optimization

📅 2025-12-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of jointly ensuring QoS, energy efficiency, and resource fairness in UAV–satellite collaborative network slicing for 6G non-terrestrial networks, this paper proposes a hierarchical network slicing architecture that jointly optimizes UAV trajectory, transmit power, and spectrum allocation. We formulate a decentralized partially observable Markov decision process (POMDP) model and design a centralized-training-with-decentralized-execution multi-agent reinforcement learning algorithm leveraging multi-head attention mechanisms to enable efficient, adaptive resource coordination under dynamic environmental conditions. Experimental results demonstrate that the proposed method achieves up to a 33% improvement in cumulative reward, a 27.5% gain in energy efficiency, and a 19.8% increase in Jain’s fairness index over baseline approaches—thereby simultaneously enhancing service quality, energy sustainability, and resource allocation fairness.

Technology Category

Application Category

📝 Abstract
Non terrestrial networks are critical for achieving global 6G coverage, yet efficient resource management in aerial and space environments remains challenging due to limited onboard power and dynamic operational conditions. Network slicing offers a promising solution for spectrum optimization in UAV based systems serving heterogeneous service demands. For that, this paper proposes a hierarchical network slicing framework for UAV satellite integrated networks supporting eMBB, URLLC, and mMTC services. Specifically, we formulate a joint optimization of UAV trajectory, transmission power, and spectrum allocation as a decentralized partially observable Markov decision process that ensures quality of service while minimizing energy consumption and maximizing resource fairness. To address the computational intractability and partial observability, we develop a multi agent deep reinforcement learning solution under the centralized training and decentralized execution paradigm. In the proposed system, UAV agents act as distributed actors coordinated by a shared critic operating with multi head attention mechanism at a low Earth orbit satellite. Experimental results then demonstrate that our approach outperforms existing methods by up to 33% in cumulative reward while achieving superior energy efficiency and fairness.
Problem

Research questions and friction points this paper is trying to address.

Optimizes UAV trajectory, power, and spectrum for QoS-energy-fairness trade-offs
Addresses resource management in UAV-satellite networks with limited power and dynamics
Solves decentralized optimization using multi-agent deep reinforcement learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical slicing for UAV-satellite networks
Multi-agent deep reinforcement learning optimization
Shared critic with attention mechanism coordination
🔎 Similar Papers
No similar papers found.
T
Thanh-Dao Nguyen
University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam
N
Ngoc-Tan Nguyen
University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam
T
Thai-Duong Nguyen
University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam
Nguyen Van Huynh
Nguyen Van Huynh
University of Liverpool
CybersecurityWireless NetworkingMobile ComputingEdge IntelligenceMachine Learning
D
Dinh-Hieu Tran
Interdisciplinary Centre for Security Reliability and Trust (SnT), University of Luxembourg, Luxembourg
Symeon Chatzinotas
Symeon Chatzinotas
Full Professor | IEEE Fellow | SIGCOM Head, SnT, University of Luxembourg
Wireless CommunicationsNon-Terrestrial NetworksInternet of Things6GQuantum Communications