Deep Q-Learning Assisted Bandwidth Reservation for Multi-Operator Time-Sensitive Vehicular Networking

📅 2025-10-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In multi-operator vehicular networks, existing static or single-objective approaches for bandwidth reservation in safety-critical applications fail to jointly optimize cost-efficiency and resource reliability, while also lacking adaptability to path dynamics and demand–supply uncertainty. Method: This paper proposes a dynamic bandwidth reservation update mechanism based on Double Deep Q-Networks (DDQN), the first to apply DDQN to collaborative bandwidth reservation across multiple operators. It integrates dual objectives—cost minimization and service reliability—via a multi-objective reward function and a carefully designed state representation model, enabling end-to-end policy learning. Contribution/Results: Experiments demonstrate that the proposed method reduces bandwidth cost by 40% compared to greedy strategies and other deep reinforcement learning baselines, while significantly enhancing resource assurance stability and robustness under highly dynamic network conditions.

Technology Category

Application Category

📝 Abstract
Very few available individual bandwidth reservation schemes provide efficient and cost-effective bandwidth reservation that is required for safety-critical and time-sensitive vehicular networked applications. These schemes allow vehicles to make reservation requests for the required resources. Accordingly, a Mobile Network Operator (MNO) can allocate and guarantee bandwidth resources based on these requests. However, due to uncertainty in future reservation time and bandwidth costs, the design of an optimized reservation strategy is challenging. In this article, we propose a novel multi-objective bandwidth reservation update approach with an optimal strategy based on Double Deep Q-Network (DDQN). The key design objectives are to minimize the reservation cost with multiple MNOs and to ensure reliable resource provisioning in uncertain situations by solving scenarios such as underbooked and overbooked reservations along the driving path. The enhancements and advantages of our proposed strategy have been demonstrated through extensive experimental results when compared to other methods like greedy update or other deep reinforcement learning approaches. Our strategy demonstrates a 40% reduction in bandwidth costs across all investigated scenarios and simultaneously resolves uncertain situations in a cost-effective manner.
Problem

Research questions and friction points this paper is trying to address.

Optimizing bandwidth reservation for time-sensitive vehicular networks
Minimizing reservation costs across multiple mobile network operators
Ensuring reliable resource provisioning in uncertain traffic scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Double Deep Q-Network for bandwidth reservation
Minimizes reservation costs across multiple network operators
Resolves underbooked and overbooked scenarios cost-effectively
🔎 Similar Papers
No similar papers found.
Abdullah Al-Khatib
Abdullah Al-Khatib
Landshut University of Applied Sciences
Vehicular NetworkIoTNetwork Resource AllocationMachine Learning
A
Albert Gergus
Department of Control Engineering and Information Technology, Budapest University of Technology and Economics, Hungary
M
Muneeb Ul Hassan
School of Information Technology, Deakin University, Australia
A
Abdelmajid Khelil
Institute for Data and Process Science, Landshut University of Applied Sciences, Germany
K
Klaus Mossner
Professorship for Communications Engineering, Technical University Chemnitz, Germany
Holger Timinger
Holger Timinger
Professor, Institute for Data and Process Science