MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking

📅 2025-03-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limitations of RGB-based trackers in single-object UAV tracking—particularly under severe occlusion and small target scales—this paper introduces MUST, the first large-scale multispectral UAV tracking benchmark comprising 250 sequences, thereby filling a critical gap in evaluation resources. We propose UNTrack, a novel framework that jointly models spectral, spatial, and temporal cues: it employs an asymmetric Transformer architecture integrated with a spectral background suppression mechanism and a dynamically updated spectral prompt encoder to enable adaptive cross-modal feature fusion. Evaluated on MUST, UNTrack significantly outperforms both RGB-only and state-of-the-art multispectral trackers in accuracy while maintaining superior computational efficiency. To foster reproducibility and community advancement, both the codebase and the MUST dataset are publicly released.

Technology Category

Application Category

📝 Abstract
UAV tracking faces significant challenges in real-world scenarios, such as small-size targets and occlusions, which limit the performance of RGB-based trackers. Multispectral images (MSI), which capture additional spectral information, offer a promising solution to these challenges. However, progress in this field has been hindered by the lack of relevant datasets. To address this gap, we introduce the first large-scale Multispectral UAV Single Object Tracking dataset (MUST), which includes 250 video sequences spanning diverse environments and challenges, providing a comprehensive data foundation for multispectral UAV tracking. We also propose a novel tracking framework, UNTrack, which encodes unified spectral, spatial, and temporal features from spectrum prompts, initial templates, and sequential searches. UNTrack employs an asymmetric transformer with a spectral background eliminate mechanism for optimal relationship modeling and an encoder that continuously updates the spectrum prompt to refine tracking, improving both accuracy and efficiency. Extensive experiments show that our proposed UNTrack outperforms state-of-the-art UAV trackers. We believe our dataset and framework will drive future research in this area. The dataset is available on https://github.com/q2479036243/MUST-Multispectral-UAV-Single-Object-Tracking.
Problem

Research questions and friction points this paper is trying to address.

Addresses lack of multispectral UAV tracking datasets
Proposes unified framework for spectral-spatial-temporal feature encoding
Improves tracking accuracy in small-target occlusion scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces MUST: first multispectral UAV tracking dataset
Proposes UNTrack: unified spectral-spatial-temporal feature framework
Employs asymmetric transformer with spectral background elimination
🔎 Similar Papers
No similar papers found.
Haolin Qin
Haolin Qin
Beijing Institute of Technology
T
Tingfa Xu
Beijing Institute of Technology
T
Tianhao Li
Beijing Institute of Technology
Zhenxiang Chen
Zhenxiang Chen
University of Jinan
Cyber IntelligenceNetwork Behavior AnalysisMachine LearningCyber Security
T
Tao Feng
Beijing Institute of Technology
J
Jianan Li
Beijing Institute of Technology