🤖 AI Summary
This work addresses the challenges of unstable detection and poor robustness of small objects in UAV aerial imagery, which arise from scale variation, detail degradation, and stringent computational constraints. To tackle these issues, the authors propose a lightweight collaborative detection framework that explicitly enhances fine-grained features prior to multi-scale fusion through a multi-backbone collaboration mechanism, cross-scale feature alignment, and a structure-preserving detail retention strategy. A unified localization-aware detection head is further introduced to improve spatial precision. The model architecture is co-optimized from three perspectives—image processing, channel design, and model lightweighting—achieving a balance between efficient inference and detailed perception. Without increasing deployment overhead, the proposed method significantly enhances the localization stability and detection robustness of small targets.
📝 Abstract
Small object detection in unmanned aerial vehicle (UAV) imagery is challenging, mainly due to scale variation, structural detail degradation, and limited computational resources. In high-altitude scenarios, fine-grained features are further weakened during hierarchical downsampling and cross-scale fusion, resulting in unstable localization and reduced robustness. To address this issue, we propose CollabOD, a lightweight collaborative detection framework that explicitly preserves structural details and aligns heterogeneous feature streams before multi-scale fusion. The framework integrates Structural Detail Preservation, Cross-Path Feature Alignment, and Localization-Aware Lightweight Design strategies. From the perspectives of image processing, channel structure, and lightweight design, it optimizes the architecture of conventional UAV perception models. The proposed design enhances representation stability while maintaining efficient inference. A unified detail-aware detection head further improves regression robustness without introducing additional deployment overhead. The code is available at: https://github.com/Bai-Xuecheng/CollabOD.