An Overlay Multicast Routing Method Based on Network Situational Aware-ness and Hierarchical Multi-Agent Reinforcement Learning

📅 2026-01-17
🏛️ Electronic Research Archive
📈 Citations: 0
Influential: 0
📄 PDF

career value

223K/year
🤖 AI Summary
This work addresses the limitations of traditional overlay multicast in adapting to dynamic traffic fluctuations and the high complexity, slow convergence, and instability of existing reinforcement learning approaches, which often fail to effectively decouple multi-objective optimization. Leveraging the global network view provided by Software-Defined Networking (SDN), the authors propose a hierarchical multi-agent deep reinforcement learning architecture that decomposes multicast tree construction into two coordinated stages. This design significantly reduces the action space and disentangles competing optimization objectives. Experimental results demonstrate that the proposed method outperforms state-of-the-art solutions in terms of end-to-end delay, bandwidth utilization, and packet loss rate, while also achieving faster convergence, superior scalability, and enhanced routing adaptability and flexibility.

Technology Category

Application Category

📝 Abstract
Compared with IP multicast, Overlay Multicast (OM) offers better compatibility and flexible deployment in heterogeneous, cross-domain networks. However, traditional OM struggles to adapt to dynamic traffic due to unawareness of physical resource states, and existing reinforcement learning methods fail to decouple OM's tightly coupled multi-objective nature, leading to high complexity, slow convergence, and instability. To address this, we propose MA-DHRL-OM, a multi-agent deep hierarchical reinforcement learning approach. Using SDN's global view, it builds a traffic-aware model for OM path planning. The method decomposes OM tree construction into two stages via hierarchical agents, reducing action space and improving convergence stability. Multi-agent collaboration balances multi-objective optimization while enhancing scalability and adaptability. Experiments show MA-DHRL-OM outperforms existing methods in delay, bandwidth utilization, and packet loss, with more stable convergence and flexible routing.
Problem

Research questions and friction points this paper is trying to address.

Overlay Multicast
Network Situational Awareness
Multi-Agent Reinforcement Learning
Multi-Objective Optimization
Dynamic Traffic Adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Overlay Multicast
Hierarchical Multi-Agent Reinforcement Learning
Network Situational Awareness
SDN-based Traffic Awareness
Multi-Objective Optimization
🔎 Similar Papers
No similar papers found.
M
Miao Ye
School of Information and Communication, Guilin University of Electronic Technology, Guilin 541000, China
Y
Yanye Chen
School of Information and Communication, Guilin University of Electronic Technology, Guilin 541000, China
Yong Wang
Yong Wang
Professor of Computer Science, Ocean University of China
Software EngineeringOperational ResearchMachine Learning
Cheng Zhu
Cheng Zhu
J. Erskine Love Jr. Endowed Chair in Engineering and Regents' Professor
BiomechanicsMechanobiologyImmunologyCancerHemostasis and Thrombosis
Q
Qiuxiang Jiang
School of Optoelectronic Engineering, Guilin University of Electronic Technology, Guilin 541000, China
G
Gai Huang
School of Information and Communication, Guilin University of Electronic Technology, Guilin 541000, China
Feng Ding
Feng Ding
Suzhou Laboratory
PhysicsChemistryMaterial Science