๐ค AI Summary
This work addresses the challenge of reliably translating natural language instructions from non-expert users into safe, spatiotemporally constrained trajectories for urban low-altitude drones. The authors propose a unified framework that first leverages a large language model (LLM) enhanced with chain-of-thought supervision and group-relative policy optimization to translate natural language into Signal Temporal Logic (STL) specifications with high grammatical validity and semantic consistency. Feasible trajectories are then synthesized via mixed-integer linear programming (MILP). Crucially, the system introduces a novel specification repair mechanism that dynamically refines infeasible STL constraints by integrating MILP-based diagnostics with LLM-driven semantic reasoning, ensuring safety. Simulations and real-world flight experiments demonstrate that this closed-loop approach significantly enhances the robustness of natural languageโtoโSTL translation, enabling safe, interpretable, and adaptive autonomous navigation in complex urban environments.
๐ Abstract
Natural language (NL) navigation for low-altitude unmanned aerial vehicles (UAVs) offers an intelligent and convenient solution for low-altitude aerial services by enabling an intuitive interface for non-expert operators. However, deploying this capability in urban environments necessitates the precise grounding of underspecified instructions into safety-critical, dynamically feasible motion plans subject to spatiotemporal constraints. To address this challenge, we propose a unified framework that translates NL instructions into Signal Temporal Logic (STL) specifications and subsequently synthesizes trajectories via mixed-integer linear programming (MILP). Specifically, to generate executable STL formulas from free-form NL, we develop a reasoning-enhanced large language model (LLM) leveraging chain-of-thought (CoT) supervision and group-relative policy optimization (GRPO), which ensures high syntactic validity and semantic consistency. Furthermore, to resolve infeasibilities induced by stringent logical or spatial requirements, we introduce a specification repair mechanism. This module combines MILP-based diagnosis with LLM-guided semantic reasoning to selectively relax task constraints while strictly enforcing safety guarantees. Extensive simulations and real-world flight experiments demonstrate that the proposed closed-loop framework significantly improves NL-to-STL translation robustness, enabling safe, interpretable, and adaptable UAV navigation in complex scenarios.