AC-MASAC: An Attentive Curriculum Learning Framework for Heterogeneous UAV Swarm Coordination

📅 2026-02-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a multi-agent reinforcement learning framework integrating role-aware heterogeneous attention mechanisms with structured curriculum learning to address key challenges in cooperative path planning for heterogeneous unmanned aerial vehicle (UAV) swarms, including asymmetric agent dependencies, sparse rewards, and catastrophic forgetting. The proposed approach models asymmetric inter-agent dependencies through a tailored attention mechanism and mitigates sparse reward and catastrophic forgetting issues via hierarchical knowledge transfer and a phased proportional experience replay strategy. Experimental results on a custom simulation platform demonstrate that the method significantly outperforms existing approaches in terms of task success rate, formation retention ratio, and weighted task completion time, while substantially improving training stability and collaborative efficiency.

Technology Category

Application Category

📝 Abstract
Cooperative path planning for heterogeneous UAV swarms poses significant challenges for Multi-Agent Reinforcement Learning (MARL), particularly in handling asymmetric inter-agent dependencies and addressing the risks of sparse rewards and catastrophic forgetting during training. To address these issues, this paper proposes an attentive curriculum learning framework (AC-MASAC). The framework introduces a role-aware heterogeneous attention mechanism to explicitly model asymmetric dependencies. Moreover, a structured curriculum strategy is designed, integrating hierarchical knowledge transfer and stage-proportional experience replay to address the issues of sparse rewards and catastrophic forgetting. The proposed framework is validated on a custom multi-agent simulation platform, and the results show that our method has significant advantages over other advanced methods in terms of Success Rate, Formation Keeping Rate, and Success-weighted Mission Time. The code is available at \textcolor{red}{https://github.com/Wanhao-Liu/AC-MASAC}.
Problem

Research questions and friction points this paper is trying to address.

heterogeneous UAV swarm
asymmetric inter-agent dependencies
sparse rewards
catastrophic forgetting
cooperative path planning
Innovation

Methods, ideas, or system contributions that make the work stand out.

attentive curriculum learning
heterogeneous UAV swarm
asymmetric dependencies
catastrophic forgetting
multi-agent reinforcement learning
🔎 Similar Papers
No similar papers found.
Wanhao Liu
Wanhao Liu
University of Science and Technology of China
AI for Science
J
Junhong Dai
Guangdong University of Technology
Y
Yixuan Zhang
Guangdong University of Technology
S
Shengyun Yin
Guangdong University of Technology
Panshuo Li
Panshuo Li
Guangdong University of Technology
switched systemstimg-varying systemsnetworked control systemsintelligent vehicls