UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility

📅 2025-01-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limited autonomous decision-making and execution capabilities of unmanned aerial vehicles (UAVs) in complex environments, this paper proposes the novel paradigm of “embodied low-altitude agents,” establishing an embodied intelligence framework integrating UAVs with large language models (LLMs). Methodologically, we design a multimodal data taxonomy and task-scenario mapping framework; integrate vision/IMU/GNSS perception, LLM-based instruction understanding and hierarchical planning, tool invocation (via APIs and flight-control interfaces), memory-augmented reasoning, and simulation-to-real co-training; and construct a domain-specific multimodal data resource atlas covering 12 representative low-altitude tasks. Key contributions include: (1) the first systematic formalization of the embodied low-altitude agent concept; (2) release of an open-source technology roadmap; and (3) proposal of a scalable Agentic UAV reference architecture, validated through prototypes in logistics and inspection scenarios—demonstrating significant improvements in task comprehension, dynamic adaptability, and autonomous execution.

Technology Category

Application Category

📝 Abstract
Low-altitude mobility, exemplified by unmanned aerial vehicles (UAVs), has introduced transformative advancements across various domains, like transportation, logistics, and agriculture. Leveraging flexible perspectives and rapid maneuverability, UAVs extend traditional systems' perception and action capabilities, garnering widespread attention from academia and industry. However, current UAV operations primarily depend on human control, with only limited autonomy in simple scenarios, and lack the intelligence and adaptability needed for more complex environments and tasks. The emergence of large language models (LLMs) demonstrates remarkable problem-solving and generalization capabilities, offering a promising pathway for advancing UAV intelligence. This paper explores the integration of LLMs and UAVs, beginning with an overview of UAV systems' fundamental components and functionalities, followed by an overview of the state-of-the-art in LLM technology. Subsequently, it systematically highlights the multimodal data resources available for UAVs, which provide critical support for training and evaluation. Furthermore, it categorizes and analyzes key tasks and application scenarios where UAVs and LLMs converge. Finally, a reference roadmap towards agentic UAVs is proposed, aiming to enable UAVs to achieve agentic intelligence through autonomous perception, memory, reasoning, and tool utilization. Related resources are available at https://github.com/Hub-Tian/UAVs_Meet_LLMs.
Problem

Research questions and friction points this paper is trying to address.

Autonomous Decision-making
Artificial Intelligence
Unmanned Aerial Vehicles
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large Language Models
Drone Autonomy
Advanced Cognitive Functions
🔎 Similar Papers
No similar papers found.
Yonglin Tian
Yonglin Tian
Institute of Automation, Chinese Academy of Sciences
Parallel intelligenceParallel umanned systemsIntelligent vehiclesAutonomous driving
Fei Lin
Fei Lin
Macau University of Science and Technology
Parallel IntelligenceLarge Language ModelEmbodied AgentAI4Science
Y
Yiduo Li
Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, 999078, China
T
Tengchao Zhang
Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, 999078, China
Q
Qiyao Zhang
School of Automation, Beijing Institute of Technology, Beijing, 100081, China
X
Xuan Fu
Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, 999078, China
J
Jun Huang
Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, 999078, China
Xingyuan Dai
Xingyuan Dai
Institute of Automation, Chinese Academy of Sciences
Artificial IntelligenceParallel IntelligenceReinforcement LearningITS
Y
Yutong Wang
The State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
C
Chunwei Tian
School of Software, Northwestern Polytechnical University, Xi’an, 710129, China
B
Bai Li
College of Mechanical and Vehicle Engineering, Hunan University, Changsha, 410082, China
Yisheng Lv
Yisheng Lv
The University of Chinese Academy of Sciences, and Chinese Academy of Sciences
Parallel IntelligenceAI for TransportationAutonomous VehiclesParallel Transportation Systems
L
Levente Kov'acs
John von Neumann Faculty of Informatics, Obuda University, Budapest, H-1034, Hungary
Fei-Yue Wang
Fei-Yue Wang
Professor, Formerly The University of Arizona, Currently Chinese Academy of Sciences
Intelligent SystemsIntelligent VehiclesRobotics and AutomationBlockchainDAO