CollabOD: Collaborative Multi-Backbone with Cross-scale Vision for UAV Small Object Detection

📅 2026-03-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of unstable detection and poor robustness of small objects in UAV aerial imagery, which arise from scale variation, detail degradation, and stringent computational constraints. To tackle these issues, the authors propose a lightweight collaborative detection framework that explicitly enhances fine-grained features prior to multi-scale fusion through a multi-backbone collaboration mechanism, cross-scale feature alignment, and a structure-preserving detail retention strategy. A unified localization-aware detection head is further introduced to improve spatial precision. The model architecture is co-optimized from three perspectives—image processing, channel design, and model lightweighting—achieving a balance between efficient inference and detailed perception. Without increasing deployment overhead, the proposed method significantly enhances the localization stability and detection robustness of small targets.

Technology Category

Application Category

📝 Abstract
Small object detection in unmanned aerial vehicle (UAV) imagery is challenging, mainly due to scale variation, structural detail degradation, and limited computational resources. In high-altitude scenarios, fine-grained features are further weakened during hierarchical downsampling and cross-scale fusion, resulting in unstable localization and reduced robustness. To address this issue, we propose CollabOD, a lightweight collaborative detection framework that explicitly preserves structural details and aligns heterogeneous feature streams before multi-scale fusion. The framework integrates Structural Detail Preservation, Cross-Path Feature Alignment, and Localization-Aware Lightweight Design strategies. From the perspectives of image processing, channel structure, and lightweight design, it optimizes the architecture of conventional UAV perception models. The proposed design enhances representation stability while maintaining efficient inference. A unified detail-aware detection head further improves regression robustness without introducing additional deployment overhead. The code is available at: https://github.com/Bai-Xuecheng/CollabOD.
Problem

Research questions and friction points this paper is trying to address.

small object detection
UAV imagery
scale variation
structural detail degradation
computational constraints
Innovation

Methods, ideas, or system contributions that make the work stand out.

Structural Detail Preservation
Cross-Path Feature Alignment
Localization-Aware Lightweight Design
Multi-Scale Fusion
Small Object Detection
🔎 Similar Papers
No similar papers found.
Xuecheng Bai
Xuecheng Bai
Shenyang Ligong University
Object DetectionLow-light Image Enchance
Y
Yuxiang Wang
The University of Sydney, NSW, Australia
Chuanzhi Xu
Chuanzhi Xu
Student, The University of Sydney
Neuromorphic VisionHigh-level VisionComputational Aesthetics
B
Boyu Hu
The University of International Business and Economics, Beijing, China
K
Kang Han
Aviation Traffic Control Technology (SHENZHEN) Co., Ltd., Shenzhen, China; Research Institute of Traffic Control Technology Co., Ltd., Beijing, China
R
Ruijie Pan
Aviation Traffic Control Technology (SHENZHEN) Co., Ltd., Shenzhen, China
X
Xiaowei Niu
Guoneng Shuohuang Railway Development Co., Ltd., Hebei, China
X
Xiaotian Guan
Guoneng Shuohuang Railway Development Co., Ltd., Hebei, China
L
Liqiang Fu
Guoneng Shuohuang Railway Development Co., Ltd., Hebei, China
P
Pengfei Ye
The Hong Kong University of Science and Technology, Hong Kong, China