🤖 AI Summary
Addressing the challenge of balancing cooperative efficiency and communication stealth in underwater multi-AUV collaborative target detection, this paper proposes a hierarchical action management framework. At the macro level, task allocation is formulated as a centralized Markov Decision Process (MDP); at the micro level, trajectory planning and transmission power control are jointly optimized via a decentralized Partially Observable MDP (POMDP). The framework supports centralized training with decentralized execution, integrating Proximal Policy Optimization (PPO) and Multi-Agent PPO, while incorporating realistic underwater acoustic channel modeling, dynamic energy consumption analysis, and adaptive trajectory design. Experimental results demonstrate that, under strict energy and maneuverability constraints, the method significantly improves both covert communication success rate and target detection accuracy. This provides a deployable solution for intelligent, adversarial underwater multi-agent coordination.
📝 Abstract
This paper investigates underwater cooperative target detection using autonomous underwater vehicles (AUVs), with a focus on the critical trade-off between cooperation efficiency and communication covertness. To tackle this challenge, we first formulate a joint trajectory and power control optimization problem, and then present an innovative hierarchical action management framework to solve it. According to the hierarchical formulation, at the macro level, the master AUV models the agent selection process as a Markov decision process and deploys the proximal policy optimization algorithm for strategic task allocation. At the micro level, each selected agent's decentralized decision-making is modeled as a partially observable Markov decision process, and a multi-agent proximal policy optimization algorithm is used to dynamically adjust its trajectory and transmission power based on its local observations. Under the centralized training and decentralized execution paradigm, our target detection framework enables adaptive covert cooperation while satisfying both energy and mobility constraints. By comprehensively modeling the considered system, the involved signals and tasks, as well as energy consumption, theoretical insights and practical solutions for the efficient and secure operation of multiple AUVs are provided, offering significant implications for the execution of underwater covert communication tasks.