Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection

šŸ“… 2025-08-18
šŸ“ˆ Citations: 0
✨ Influential: 0
šŸ“„ PDF
šŸ¤– AI Summary
To address resource constraints, severe occlusion, and limited wide-area coverage in multi-UAV cooperative 3D detection, this paper proposes AdaBEV—a novel framework for efficient and robust bird’s-eye-view (BEV) representation learning. First, a box-guided refinement module adaptively focuses on foreground instance regions. Second, an instance-background contrastive learning mechanism is introduced to enforce discriminative feature separation directly in BEV space. Third, lightweight BEV optimization—integrated with 2D supervision and spatial subdivision—is employed to generate instance-aware BEV representations from low-resolution inputs. By departing from conventional uniform-grid BEV modeling, AdaBEV significantly enhances occlusion robustness and large-scale scene perception. On the Air-Co-Pred benchmark, AdaBEV achieves state-of-the-art accuracy with substantially lower computational overhead, approaching the performance upper bound of high-resolution methods.

Technology Category

Application Category

šŸ“ Abstract
Multi-UAV collaborative 3D detection enables accurate and robust perception by fusing multi-view observations from aerial platforms, offering significant advantages in coverage and occlusion handling, while posing new challenges for computation on resource-constrained UAV platforms. In this paper, we present AdaBEV, a novel framework that learns adaptive instance-aware BEV representations through a refine-and-contrast paradigm. Unlike existing methods that treat all BEV grids equally, AdaBEV introduces a Box-Guided Refinement Module (BG-RM) and an Instance-Background Contrastive Learning (IBCL) to enhance semantic awareness and feature discriminability. BG-RM refines only BEV grids associated with foreground instances using 2D supervision and spatial subdivision, while IBCL promotes stronger separation between foreground and background features via contrastive learning in BEV space. Extensive experiments on the Air-Co-Pred dataset demonstrate that AdaBEV achieves superior accuracy-computation trade-offs across model scales, outperforming other state-of-the-art methods at low resolutions and approaching upper bound performance while maintaining low-resolution BEV inputs and negligible overhead.
Problem

Research questions and friction points this paper is trying to address.

Enhances multi-UAV collaborative 3D detection accuracy
Reduces computation for resource-constrained UAV platforms
Improves foreground-background feature separation in BEV space
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adaptive instance-aware BEV representations
Box-Guided Refinement Module (BG-RM)
Instance-Background Contrastive Learning (IBCL)
šŸ”Ž Similar Papers
No similar papers found.
Z
Zhongyao Li
Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences; University of Chinese Academy of Sciences
P
Peirui Cheng
Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences
L
Liangjin Zhao
Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences
C
Chen Chen
Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences; University of Chinese Academy of Sciences
Y
Yundu Li
Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences; University of Chinese Academy of Sciences
Z
Zhechao Wang
Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences; University of Chinese Academy of Sciences
X
Xue Yang
Shanghai Jiao Tong University
Xian Sun
Xian Sun
AerospaceĀ InformationĀ ResearchĀ Institute,Ā ChineseĀ AcademyĀ ofĀ Sciences
Remote SensingComputer Vision and Pattern RecognitionArtificial Intelligence
Zhirui Wang
Zhirui Wang
Aerospace Information Research Institute, Chinese Academy of Sciences
Remote sensing image interpretationtarget detectiontarget recognition