🤖 AI Summary
This work proposes a multi-agent reinforcement learning framework integrating role-aware heterogeneous attention mechanisms with structured curriculum learning to address key challenges in cooperative path planning for heterogeneous unmanned aerial vehicle (UAV) swarms, including asymmetric agent dependencies, sparse rewards, and catastrophic forgetting. The proposed approach models asymmetric inter-agent dependencies through a tailored attention mechanism and mitigates sparse reward and catastrophic forgetting issues via hierarchical knowledge transfer and a phased proportional experience replay strategy. Experimental results on a custom simulation platform demonstrate that the method significantly outperforms existing approaches in terms of task success rate, formation retention ratio, and weighted task completion time, while substantially improving training stability and collaborative efficiency.
📝 Abstract
Cooperative path planning for heterogeneous UAV swarms poses significant challenges for Multi-Agent Reinforcement Learning (MARL), particularly in handling asymmetric inter-agent dependencies and addressing the risks of sparse rewards and catastrophic forgetting during training. To address these issues, this paper proposes an attentive curriculum learning framework (AC-MASAC). The framework introduces a role-aware heterogeneous attention mechanism to explicitly model asymmetric dependencies. Moreover, a structured curriculum strategy is designed, integrating hierarchical knowledge transfer and stage-proportional experience replay to address the issues of sparse rewards and catastrophic forgetting. The proposed framework is validated on a custom multi-agent simulation platform, and the results show that our method has significant advantages over other advanced methods in terms of Success Rate, Formation Keeping Rate, and Success-weighted Mission Time. The code is available at \textcolor{red}{https://github.com/Wanhao-Liu/AC-MASAC}.