VIL2C: Value-of-Information Aware Low-Latency Communication for Multi-Agent Reinforcement Learning

📅 2025-11-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address action lag and stale information caused by communication delays in multi-agent reinforcement learning (MARL), this paper proposes an information-value-aware low-latency communication mechanism. The method introduces: (1) a computable Value of Information (VOI) metric that jointly quantifies message timeliness and decision impact; (2) a progressive message reception protocol that dynamically prioritizes high-VOI messages and adaptively adjusts waiting duration; and (3) theoretical guarantees on convergence and bounded end-to-end delay. Experiments in time-critical autonomous driving scenarios demonstrate that the approach reduces latency for high-VOI messages by 37.2% on average, eliminates redundant waiting, and improves both coordination efficiency and MARL policy performance. The method consistently outperforms state-of-the-art baselines across diverse channel conditions.

Technology Category

Application Category

📝 Abstract
Inter-agent communication serves as an effective mechanism for enhancing performance in collaborative multi-agent reinforcement learning(MARL) systems. However, the inherent communication latency in practical systems induces both action decision delays and outdated information sharing, impeding MARL performance gains, particularly in time-critical applications like autonomous driving. In this work, we propose a Value-of-Information aware Low-latency Communication(VIL2C) scheme that proactively adjusts the latency distribution to mitigate its effects in MARL systems. Specifically, we define a Value of Information (VOI) metric to quantify the importance of delayed message transmission based on each delayed message's importance. Moreover, we propose a progressive message reception mechanism to adaptively adjust the reception duration based on received messages. We derive the optimized VoI aware resource allocation and theoretically prove the performance advantage of the proposed VIL2C scheme. Extensive experiments demonstrate that VIL2C outperforms existing approaches under various communication conditions. These gains are attributed to the low-latency transmission of high-VoI messages via resource allocation and the elimination of unnecessary waiting periods via adaptive reception duration.
Problem

Research questions and friction points this paper is trying to address.

Addresses communication latency effects in multi-agent reinforcement learning
Proposes VoI metric to quantify delayed message importance
Optimizes resource allocation for low-latency transmission of critical messages
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proactively adjusts latency distribution to mitigate delays
Defines Value of Information metric for message importance
Uses adaptive reception mechanism to eliminate waiting periods
🔎 Similar Papers
No similar papers found.
Q
Qian Zhang
Northwestern Polytechnical University
Zhuo Sun
Zhuo Sun
Australian National University
Wireless Comunications
Y
Yao Zhang
Northwestern Polytechnical University
Z
Zhiwen Yu
Northwestern Polytechnical University, Harbin Engineering University
B
Bin Guo
Northwestern Polytechnical University
J
Jun Zhang
The Hong Kong University of Science and Technology