Towards UAV Detection in the Real World: A New Multispectral Dataset UAVNet-MS and a New Method

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

211K/year
🤖 AI Summary
Existing RGB-based drone detection methods suffer significant performance degradation in complex scenarios involving small-scale targets, low contrast, and background clutter, and there is a notable absence of multispectral datasets tailored for fine-grained micro-drone detection. To address these challenges, this work introduces UAVNet-MS, the first synchronized RGB–multispectral drone detection dataset comprising 15,618 aligned image pairs, and proposes MFDNet, a dual-stream network that effectively integrates material-aware spectral information through disparity correction and spatial–spectral feature fusion. Experimental results demonstrate that MFDNet outperforms the best RGB-only method by 6.2% in AP50, establishing a foundational dataset, a strong baseline model, and a public evaluation benchmark for multispectral drone monitoring.
📝 Abstract
The proliferation of unmanned aerial vehicles (UAVs) has created urgent demand for precise UAV monitoring. Existing RGB-based systems rely on spatial cues that degrade at small scales, particularly with high inter-type similarity, target-clutter ambiguity, and low contrast. Multispectral imaging (MSI) encodes material-aware spectral signatures, yet MSI-based fine-grained small-UAV detection remains underexplored due to lack of dedicated datasets. We introduce UAVNet-MS, the first multispectral dataset for fine-grained small-UAV detection, comprising 15,618 temporally synchronized RGB-MSI data cubes (1440x1080) with bounding box annotations. The dataset features challenging small objects (93.7% <= 32^2 pixels, average 18^2 pixels, ~0.02% image area) under low contrast. We propose MFDNet, a dual-stream baseline addressing array-induced parallax and spatial-spectral fusion. Extensive evaluation under RGB-only, MSI-only, and RGB+MSI protocols against 20 detectors shows MFDNet achieves +6.2% AP50 improvement over best RGB-only methods, demonstrating spectral cues provide complementary material evidence beyond spatial cues. This work provides foundational dataset, strong baseline, and benchmark for multispectral UAV monitoring research.
Problem

Research questions and friction points this paper is trying to address.

UAV detection
multispectral imaging
small object detection
fine-grained recognition
low contrast
Innovation

Methods, ideas, or system contributions that make the work stand out.

multispectral imaging
small UAV detection
UAVNet-MS
spatial-spectral fusion
MFDNet
🔎 Similar Papers
No similar papers found.
Yihang Luo
Yihang Luo
Nanyang Technological University
Artificial IntelligenceComputer Vision
J
Jun Chen
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
C
Chao Xiao
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
Yingqian Wang
Yingqian Wang
National University of Defense Technology
light fieldimage super-resolution
Z
Zhaoxu Li
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
Qiang Ling
Qiang Ling
University of Science and Technology of China
Networked control systemsImage processing
X
Xu He
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
N
Nuo Chen
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
G
Gaowei Guo
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
H
Hongge Li
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
Miao Li
Miao Li
Georgia institute of technology
general machine learninguncertainty quantificationgenerative modelsrloptimization proxies
Longguang Wang
Longguang Wang
NUDT
low-level vision3D visiondeep learning
Yulan Guo
Yulan Guo
Professor, Sun Yat-sen University
3D VisionMachine LearningRobotics
L
Li Liu
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
W
Wei An
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
Z
Zhijie Chen
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China