Agentic AI Empowered Multi-UAV Trajectory Optimization in Low-Altitude Economy Networks

πŸ“… 2025-08-22
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address insufficient autonomy and adaptability in multi-UAV cooperative trajectory optimization for low-altitude economic networks, this paper proposes ARMAITβ€”a unified end-to-end framework. First, it employs Agentic Retrieval-Augmented Generation (Agentic RAG) to autonomously parse task requirements. Second, it introduces a Mamba-Attention hybrid architecture (MAIT) that jointly optimizes long-range dependency modeling efficiency and fine-grained local feature capture. Third, it formulates Trajectory Group Relative Policy Optimization (T-GRPO), a novel algorithm unifying discrete task assignment and continuous trajectory control within a single joint policy space. Experiments across diverse-scale multi-UAV systems demonstrate that ARMAIT significantly improves planning efficiency, robustness, and generalization capability compared to state-of-the-art baselines. The framework establishes a scalable, interpretable, and end-to-end decision-making paradigm for low-altitude intelligent traffic management.

Technology Category

Application Category

πŸ“ Abstract
This paper proposes a novel Agentic Retrieval-augmented generation with Mamba-Attention Integrated Transformer (ARMAIT) framework for multi-Unmanned Aerial Vehicle (UAV) trajectory optimization. The framework is built upon Large Language Models (LLMs), incorporating Retrieval-Augmented Generation (RAG) empowered by Agentic AI and integrated with a UAV-specific knowledge base. Through the Agentic RAG, the LLM autonomously interprets high-level task requirements and identifies the key components necessary for trajectory optimization, including model inputs and outputs, network architecture, reward functions, and task constraints. To support efficient modeling across different system scales, we introduce the Mamba-Attention Integrated Transformer (MAIT), a hybrid neural architecture that combines the long-range dependency modeling capability of attention mechanisms with the efficient temporal dynamic representation of Mamba. Furthermore, a Trajectory-Group Relative Policy Optimization (T-GRPO) method is proposed to achieve unified policy gradient optimization in both discrete and continuous trajectory spaces for MAIT training. Extensive experimental results validate the feasibility and effectiveness of the proposed ARMAIT framework.
Problem

Research questions and friction points this paper is trying to address.

Optimizing multi-UAV trajectories in low-altitude economy networks
Autonomous interpretation of high-level task requirements via Agentic AI
Unified policy optimization across discrete and continuous trajectory spaces
Innovation

Methods, ideas, or system contributions that make the work stand out.

Agentic RAG framework with UAV knowledge base
Hybrid Mamba-Attention Transformer for efficient modeling
Trajectory-Group Relative Policy Optimization method
πŸ”Ž Similar Papers
No similar papers found.
F
Feibo Jiang
Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, China
L
Li Dong
Key Laboratory of Hunan Province for New Retail Virtual Reality Technology, Hunan University of Technology and Business, Changsha, China
X
Xitao Pan
Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, China
Kezhi Wang
Kezhi Wang
Professor, Royal Society Industry Fellow, Brunel University London
Wireless CommunicationEdge ComputingMachine Learning
Cunhua Pan
Cunhua Pan
Professor, Southeast University
RISUAVISACURLLC