Constrained MARL for Coexisting TN-NTN Resource Allocation: Scalability and Flexibility

📅 2026-01-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of joint constrained resource allocation in large-scale, highly dynamic coexistence scenarios between terrestrial and non-terrestrial networks. To tackle this problem, the authors propose a cross-segment interference-aware decomposition mechanism and develop a sequential multi-agent reinforcement learning framework. By incorporating randomized training environments to model system dynamics, the approach significantly enhances the adaptability and robustness of the learned policies in real-world conditions. Extensive experiments across various configurations under a full 20 MHz bandwidth demonstrate the superiority of the proposed method: it achieves substantially improved scalability compared to existing solutions and maintains stable performance even in highly dynamic user environments.

Technology Category

Application Category

📝 Abstract
This paper considers the joint TN-NTN constrained resource allocation, where terrestrial base stations and non-terrestrial base stations coexist in the spectrum. We focus on large-scale and practical scenarios characterized by large numbers of transmission channels and users, alongside highly dynamic user behaviors. As common learning solutions fail to address these challenges, we propose a decomposition solution based on the special properties of the cross-segment interference, and then tackle the original problem via solving subproblems in a sequential learning manner. Furthermore, to enhance the flexibility of the learned policies, we design a stochastic training environment that captures the key characteristics of real-world systems. Simulation results tested on the full 20MHz bandwidth with various numerologies show that our solution significantly improves scalability compared to existing solutions and remains robust in highly dynamic scenarios.
Problem

Research questions and friction points this paper is trying to address.

Constrained MARL
TN-NTN coexistence
resource allocation
scalability
dynamic user behavior
Innovation

Methods, ideas, or system contributions that make the work stand out.

Constrained MARL
TN-NTN Coexistence
Scalable Resource Allocation
Sequential Learning
Stochastic Training Environment
🔎 Similar Papers
Cuong Le
Cuong Le
PhD student
human dynamics3D motionmuscle activity
Thang X. Vu
Thang X. Vu
SnT, University of Luxembourg
Wireless communications6GMachine learning for communicationsNon-terrestrial networks
S
S. Andrenacci
SES Satellites, Betzdorf, Luxembourg
S
S. Chatzinotas
Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg