Robust Multi-agent Communication Based on Decentralization-Oriented Adversarial Training

📅 2025-04-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing multi-agent reinforcement learning (MARL) communication strategies often converge to suboptimal equilibria, resulting in highly concentrated and structurally imbalanced channel usage—where failure of a single critical channel triggers systemic collapse. Method: We propose a decentralized adversarial training framework—the first to incorporate sociological decentralization theory into MARL communication modeling. Our Dynamic Masking Adversarial Channel module (DMAC_Adv) actively identifies and masks overloaded channels during training, steering policies toward balanced, robust, decentralized topologies. The approach integrates differentiable communication architectures with MARL, requiring no centralized coordination. Contribution/Results: Evaluated on four standard MARL benchmarks, our method significantly improves communication robustness and task performance while achieving low communication overhead and high fault tolerance. It enables scalable, resilient decentralized communication topologies without structural fragility or central dependency.

Technology Category

Application Category

📝 Abstract
In typical multi-agent reinforcement learning (MARL) problems, communication is important for agents to share information and make the right decisions. However, due to the complexity of training multi-agent communication, existing methods often fall into the dilemma of local optimization, which leads to the concentration of communication in a limited number of channels and presents an unbalanced structure. Such unbalanced communication policy are vulnerable to abnormal conditions, where the damage of critical communication channels can trigger the crash of the entire system. Inspired by decentralization theory in sociology, we propose DMAC, which enhances the robustness of multi-agent communication policies by retraining them into decentralized patterns. Specifically, we train an adversary DMAC_Adv which can dynamically identify and mask the critical communication channels, and then apply the adversarial samples generated by DMAC_Adv to the adversarial learning of the communication policy to force the policy in exploring other potential communication schemes and transition to a decentralized structure. As a training method to improve robustness, DMAC can be fused with any learnable communication policy algorithm. The experimental results in two communication policies and four multi-agent tasks demonstrate that DMAC achieves higher improvement on robustness and performance of communication policy compared with two state-of-the-art and commonly-used baselines. Also, the results demonstrate that DMAC can achieve decentralized communication structure with acceptable communication cost.
Problem

Research questions and friction points this paper is trying to address.

Enhances robustness in multi-agent communication policies
Prevents over-reliance on limited critical communication channels
Promotes decentralized communication structures for system resilience
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decentralization-oriented adversarial training for robustness
Dynamic critical channel identification and masking
Fusion with any learnable communication policy algorithm
🔎 Similar Papers
No similar papers found.
X
Xuyan Ma
Science and Technology on Integrated Information System Laboratory, Institute of Software Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Yawen Wang
Yawen Wang
The University of Texas at Arlington
Gear DynamicsNoise and Vibration
J
Junjie Wang
Science and Technology on Integrated Information System Laboratory, Institute of Software Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Xiaofei Xie
Xiaofei Xie
Singapore Management University
Software EngineeringLoop AnalysisTestingDeep Learning
B
Boyu Wu
Science and Technology on Integrated Information System Laboratory, Institute of Software Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Shoubin Li
Shoubin Li
Institute of Software, Chinese Academy of Sciences
Knowledge Graph
F
Fanjiang Xu
Science and Technology on Integrated Information System Laboratory, Institute of Software Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Q
Qing Wang
Science and Technology on Integrated Information System Laboratory, Institute of Software Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China