A Deep RL Approach on Task Placement and Scaling of Edge Resources for Cellular Vehicle-to-Network Service Provisioning

📅 2023-05-16
🏛️ IEEE Transactions on Network and Service Management
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In cellular vehicle-to-network (C-V2N) systems, joint optimization of service task placement and edge resource scaling is NP-hard, tightly coupled, and subject to stringent latency requirements. Method: This paper proposes the first Deep Hybrid Policy Gradient (DHPG) algorithm supporting a hybrid action space—simultaneously handling discrete task deployment decisions and continuous resource scaling control—within an end-to-end, real-time online deep reinforcement learning (DRL) framework. The approach is validated using realistic C-V2N traffic traces. Contribution/Results: Under a strict 99th-percentile end-to-end latency constraint, the method achieves significantly higher resource utilization than state-of-the-art (SOTA) baselines, while maintaining decision-making latency in the millisecond range—demonstrating practical deployability in production C-V2N environments.
📝 Abstract
Cellular-Vehicle-to-Everything (C-V2X) is currently at the forefront of the digital transformation of our society. By enabling vehicles to communicate with each other and with the traffic environment using cellular networks, we redefine transportation, improving road safety and transportation services, increasing efficiency of vehicular traffic flows, and reducing environmental impact. To effectively facilitate the provisioning of Cellular Vehicular-to-Network (C-V2N) services, we tackle the interdependent problems of service task placement and scaling of edge resources. Specifically, we formulate the joint problem and prove that it is not computationally tractable. To address its complexity we propose Deep Hybrid Policy Gradient (DHPG), a new Deep Reinforcement Learning (DRL) approach that operates in hybrid action spaces, enabling holistic decision-making and enhancing overall performance. We evaluated the performance of DHPG using simulations with a real-world C-V2N traffic dataset, comparing it to several state-of-the-art (SoA) solutions. DHPG outperforms these solutions, guaranteeing the $99^{th}$ percentile of C-V2N service delay target, while simultaneously optimizing the utilization of computing resources. Finally, time complexity analysis is conducted to verify that the proposed approach can support real-time C-V2N services.
Problem

Research questions and friction points this paper is trying to address.

Optimizing task placement and edge resource scaling for C-V2N services
Addressing computational intractability in joint service management
Ensuring real-time performance and resource efficiency in C-V2X
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep RL for task placement and scaling
Hybrid action spaces for decision-making
Optimizes resource use and service delay
🔎 Similar Papers
No similar papers found.
Cyril Shih-Huan Hsu
Cyril Shih-Huan Hsu
Postdoctoral Researcher, University of Amsterdam
Deep LearningReinforcement LearningComputer VisionEvolutionary Computation
J
Jorge Mart'in-P'erez
Departamento de Ingenier'ia de Sistemas Telem'aticos, Universidad Polit'ecnica de Madrid
D
D. D. Vleeschauwer
K
K. Kondepu
L
L. Valcarenghi
X
Xi Li
C
C. Papagianni
University of Amsterdam