Dual-branch Distilled Transformer for Efficient Asymmetric UAV Tracking

πŸ“… 2026-05-27
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the limited performance of lightweight drone tracking models in complex scenarios, which stems from insufficient feature representation due to backbone simplification. To overcome this, we propose EATrack, a novel framework featuring a teacher-guided, spatially focused dual-branch distillation mechanism that integrates both feature-level and prediction-level knowledge transfer. EATrack further incorporates a fine-grained target-aware attention module and a temporal adaptive inference strategy to enhance the student model’s discriminative power and robustness. Extensive experiments on five UAV tracking benchmarks demonstrate that EATrack significantly outperforms existing lightweight trackers while maintaining efficient inference, achieving an optimal balance between accuracy and speed.
πŸ“ Abstract
Given the real-time demands of UAV tracking, many methods simplify the backbone to reduce computation, but this often weakens feature representation and degrades performance in complex scenarios. To alleviate this issue, we propose EATrack, an efficient and asymmetric UAV tracking framework centered around a teacher-guided dual-branch distillation strategy that enhances the feature expressiveness of the lightweight student model. Specifically, EATrack investigates two complementary perspectives of knowledge transfer: spatially focused feature-level distillation that compensates for weakened representations by guiding the student to learn strong target representations, and prediction-level distillation that enhances spatial localization by learning the teacher's capability for accurate target localization. Furthermore, to enhance robustness against appearance variations, we introduce a fine-grained target-aware distillation strategy that selectively transfers the teacher's target modeling capacity to the student. A temporal adaptation module is incorporated at inference to enhance robustness over time. Experiments on five UAV benchmarks demonstrate that EATrack achieves a favorable balance between accuracy and speed. Code: https://github.com/GXNU-ZhongLab/EATrack
Problem

Research questions and friction points this paper is trying to address.

UAV tracking
feature representation
real-time performance
model simplification
tracking accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

dual-branch distillation
asymmetric tracking
feature-level distillation
prediction-level distillation
temporal adaptation
H
Hongtao Yang
Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004, China; Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, 541004, China
B
Bineng Zhong
Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004, China; Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, 541004, China
Q
Qihua Liang
Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004, China; Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, 541004, China
Yaozong Zheng
Yaozong Zheng
Guangxi Normal University
Visual TrackingMultimodal Tracking
Xiantao Hu
Xiantao Hu
Nanjing University of Science & Technology
Computer VIsion
Y
Yuanliang Xue
Xi’an Research Institute of High Technology, Xi’an 710025, China
S
Shuxiang Song
Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004, China; Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, 541004, China