SliceFed: Federated Constrained Multi-Agent DRL for Dynamic Spectrum Slicing in 6G

📅 2026-03-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of guaranteeing 1 ms ultra-reliable low-latency communication (URLLC) in dense 6G deployments, where dynamic spectrum slicing must simultaneously contend with non-stationary channels, stringent quality-of-service (QoS) constraints, and data privacy requirements. To this end, the paper proposes SliceFed, a novel framework that uniquely integrates federated learning, constrained multi-agent reinforcement learning, and Lagrangian duality to model spectrum slicing as a constrained Markov decision process (CMDP). SliceFed enables privacy-preserving collaborative policy learning through a proximal policy optimization (PPO)-based primal-dual algorithm combined with federated averaging. Experimental results demonstrate that SliceFed achieves nearly 100% compliance with the 1 ms URLLC latency target in dense multi-cell environments, substantially outperforming heuristic and unconstrained baselines while exhibiting strong robustness to variations in traffic load.

Technology Category

Application Category

📝 Abstract
Dynamic spectrum slicing is a critical enabler for 6G Radio Access Networks (RANs), allowing the coexistence of heterogeneous services. However, optimizing resource allocation in dense, interference-limited deployments remains challenging due to non-stationary channel dynamics, strict Quality-of-Service (QoS) requirements, and the need for data privacy. In this paper, we propose SliceFed, a novel Federated Constrained Multi-Agent Deep Reinforcement Learning (F-MADRL) framework. SliceFed formulates the slicing problem as a Constrained Markov Decision Process (CMDP) where autonomous gNB agents maximize spectral efficiency while explicitly satisfying inter-cell interference budgets and hard ultra-reliable low-latency communication (URLLC) latency deadlines. We employ a Lagrangian primal-dual approach integrated with Proximal Policy Optimization (PPO) to enforce constraints, while Federated Averaging enables collaborative learning without exchanging raw local data. Extensive simulations in a dense multi-cell environment demonstrate that SliceFed converges to a stable, safety-aware policy. Unlike heuristic and unconstrained baselines, SliceFed achieves nearly 100% satisfaction of 1~ms URLLC latency deadlines and exhibits superior robustness to traffic load variations, verifying its potential for reliable and scalable 6G spectrum management.
Problem

Research questions and friction points this paper is trying to address.

dynamic spectrum slicing
6G RAN
QoS constraints
data privacy
inter-cell interference
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Constrained Multi-Agent DRL
Dynamic Spectrum Slicing
Constrained Markov Decision Process
URLLC
Federated Averaging
🔎 Similar Papers
No similar papers found.
Hossein Mohammadi
Hossein Mohammadi
Mississippi State University
Wireless CommunicationMachine LearningANNDigital Communication Systems
S
Seyed Bagher Hashemi Natanzi
Department of Electrical and Computer Engineering, Worcester Polytechnic Institute, Worcester, MA, USA
R
Ramak Nassiri
Department of Electrical and Computer Engineering, Mississippi State University, Starkville, MS, USA
J
Jamshid Hassanpour
Department of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Bo Tang
Bo Tang
Associate Professor, Worcester Polytechnic Institute
Bio-inspired AIAI SecurityDistributed AIand Cyber-Physical Systems
Vuk Marojevic
Vuk Marojevic
Paul B Jacob Professor, Mississippi State University