🤖 AI Summary
This paper addresses the constrained load balancing problem in communication networks, overcoming the limitations of conventional Join-the-Shortest-Queue (JSQ)-type policies—which ignore practical constraints such as bandwidth capacity and minimum queue utilization. For the first time, it systematically models the dual constraints—action-dependent (e.g., bandwidth allocation) and state-dependent (e.g., lower bound on queue utilization)—as a Constrained Markov Decision Process (CMDP). The proposed scheduling policy is theoretically grounded, guarantees strict satisfaction of all constraints, and integrates dynamic programming with policy optimization techniques. Extensive large-scale simulations demonstrate that the policy achieves 100% constraint compliance while significantly reducing average system occupancy. It attains a superior trade-off between end-to-end latency and resource utilization, thereby bridging theoretical rigor with engineering practicality.
📝 Abstract
Join-the-shortest queue (JSQ) and its variants have often been used in solving load balancing problems. JSQ minimizes the average system occupation, e.g., the customer's system time. In this paper, we extend the load balancing setting to include constraints that may be imposed due to the communication network. In particular, we cast the problem in the framework of constrained MDPs: this permit us to address both action-dependent constraints, such as, e.g, bandwidth constraints, and state-dependent constraints, such as, e.g., minimum queue utilization constraints. Unlike the state-of-the-art approaches in load balancing, our policies satisfy the constraints while delivering favorable results in terms of system occupancy. We derive policies that provably satisfy the constraints and evaluate their performance through extensive simulations.