LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space

πŸ“… 2025-11-24
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the lack of dedicated benchmarks for three-dimensional (3D) perception of low-altitude aircraft (LAAs), this work introduces LAA3Dβ€”the first large-scale, multi-scenario 3D perception dataset for LAAs, comprising 15,000 real-world images and 600,000 high-fidelity synthetic frames. It supports three core tasks: 3D object detection, multi-object tracking (MOT), and 6-degree-of-freedom (6-DoF) pose estimation. We propose the first unified multi-task evaluation benchmark tailored to LAAs and design MonoLAA, a monocular 3D detector incorporating zoom-camera multi-focal-length inputs as a baseline. Experiments demonstrate that pretraining on synthetic data significantly enhances performance on real-world data, enabling strong simulation-to-reality (sim-to-real) transfer. The baseline achieves robust 3D localization and cross-domain generalization on real data. This work establishes a foundational dataset and a methodological paradigm for intelligent low-altitude perception research.

Technology Category

Application Category

πŸ“ Abstract
Perception of Low-Altitude Aircraft (LAA) in 3D space enables precise 3D object localization and behavior understanding. However, datasets tailored for 3D LAA perception remain scarce. To address this gap, we present LAA3D, a large-scale dataset designed to advance 3D detection and tracking of low-altitude aerial vehicles. LAA3D contains 15,000 real images and 600,000 synthetic frames, captured across diverse scenarios, including urban and suburban environments. It covers multiple aerial object categories, including electric Vertical Take-Off and Landing (eVTOL) aircraft, Micro Aerial Vehicles (MAVs), and Helicopters. Each instance is annotated with 3D bounding box, class label, and instance identity, supporting tasks such as 3D object detection, 3D multi-object tracking (MOT), and 6-DoF pose estimation. Besides, we establish the LAA3D Benchmark, integrating multiple tasks and methods with unified evaluation protocols for comparison. Furthermore, we propose MonoLAA, a monocular 3D detection baseline, achieving robust 3D localization from zoom cameras with varying focal lengths. Models pretrained on synthetic images transfer effectively to real-world data with fine-tuning, demonstrating strong sim-to-real generalization. Our LAA3D provides a comprehensive foundation for future research in low-altitude 3D object perception.
Problem

Research questions and friction points this paper is trying to address.

Addressing the scarcity of 3D datasets for low-altitude aircraft perception
Advancing 3D detection and tracking of aerial vehicles in diverse environments
Establishing a benchmark with unified evaluation protocols for comparison
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large-scale dataset for 3D aircraft detection
Monocular baseline method for robust 3D localization
Synthetic-to-real transfer with fine-tuning generalization
πŸ”Ž Similar Papers
No similar papers found.
Hai Wu
Hai Wu
The University of Hong Kong
Shuai Tang
Shuai Tang
South China University of Technology, Pengcheng Laboratory
Jiale Wang
Jiale Wang
HKUST, BUPT
Medical robots
L
Longkun Zou
Pengcheng Laboratory
M
Mingyue Guo
Pengcheng Laboratory
R
Rongqin Liang
Pengcheng Laboratory
K
Ke Chen
Pengcheng Laboratory
Yaowei Wang
Yaowei Wang
The Hong Kong Polytechnic University