Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection

📅 2025-05-09

📈 Citations: 0

✨ Influential: 0

career value

168K/year

🤖 AI Summary

To address inefficiency and high computational overhead in small-object detection—caused by redundant feature utilization and rigid query assignment—this paper proposes a density-guided feature-query coordination mechanism. Specifically, we design a Density-Focused Extractor (DeFE) for foreground-sensitive feature enhancement; introduce Masked Window Sparse Attention (MWAS) to reduce Transformer complexity; and propose Progressive Adaptive Query Initialization (PAQI), which dynamically generates high-quality queries based on spatial target density. Integrated into a lightweight DETR architecture, our approach jointly leverages foreground clustering masks, sparse attention, and spatially adaptive density modulation. Experiments on AI-TOD-V2 and VisDrone demonstrate AP improvements of +3.3 and +2.5, respectively, achieving state-of-the-art accuracy while maintaining low FLOPs and a compact model size (<30M parameters).

Technology Category

Application Category

📝 Abstract

Tiny object detection plays a vital role in drone surveillance, remote sensing, and autonomous systems, enabling the identification of small targets across vast landscapes. However, existing methods suffer from inefficient feature leverage and high computational costs due to redundant feature processing and rigid query allocation. To address these challenges, we propose Dome-DETR, a novel framework with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection. To reduce feature redundancies, we introduce a lightweight Density-Focal Extractor (DeFE) to produce clustered compact foreground masks. Leveraging these masks, we incorporate Masked Window Attention Sparsification (MWAS) to focus computational resources on the most informative regions via sparse attention. Besides, we propose Progressive Adaptive Query Initialization (PAQI), which adaptively modulates query density across spatial areas for better query allocation. Extensive experiments demonstrate that Dome-DETR achieves state-of-the-art performance (+3.3 AP on AI-TOD-V2 and +2.5 AP on VisDrone) while maintaining low computational complexity and a compact model size. Code will be released upon acceptance.

Problem

Research questions and friction points this paper is trying to address.

Inefficient feature leverage in tiny object detection

High computational costs from redundant feature processing

Rigid query allocation in existing detection methods

Innovation

Methods, ideas, or system contributions that make the work stand out.

Density-Focal Extractor reduces feature redundancies

Masked Window Attention focuses on informative regions

Progressive Adaptive Query improves query allocation

🔎 Similar Papers

A DeNoising FPN With Transformer R-CNN for Tiny Object Detection