OD3: Optimization-free Dataset Distillation for Object Detection

πŸ“… 2025-06-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the lack of efficient, optimization-free methods for dataset distillation in object detection, this paper proposes OD3β€”the first optimization-free data distillation framework specifically designed for detection tasks. OD3 employs a two-stage strategy: (i) spatially aware, instance-level synthesis via iterative candidate instance localization, and (ii) confidence-driven instance selection using a frozen pre-trained observer modelβ€”both stages requiring no gradient-based optimization. The result is a compact, high-informativeness detection dataset. On MS COCO and PASCAL VOC, OD3 achieves competitive or superior detection performance using only 0.25%–5% of the original training data. Notably, at a 1.0% compression ratio on COCO, OD3 improves mAPβ‚…β‚€ by over 14 percentage points compared to the full dataset baseline, significantly outperforming existing detection distillation and core-set methods and establishing new state-of-the-art performance in detection-specific data distillation.

Technology Category

Application Category

πŸ“ Abstract
Training large neural networks on large-scale datasets requires substantial computational resources, particularly for dense prediction tasks such as object detection. Although dataset distillation (DD) has been proposed to alleviate these demands by synthesizing compact datasets from larger ones, most existing work focuses solely on image classification, leaving the more complex detection setting largely unexplored. In this paper, we introduce OD3, a novel optimization-free data distillation framework specifically designed for object detection. Our approach involves two stages: first, a candidate selection process in which object instances are iteratively placed in synthesized images based on their suitable locations, and second, a candidate screening process using a pre-trained observer model to remove low-confidence objects. We perform our data synthesis framework on MS COCO and PASCAL VOC, two popular detection datasets, with compression ratios ranging from 0.25% to 5%. Compared to the prior solely existing dataset distillation method on detection and conventional core set selection methods, OD3 delivers superior accuracy, establishes new state-of-the-art results, surpassing prior best method by more than 14% on COCO mAP50 at a compression ratio of 1.0%. Code and condensed datasets are available at: https://github.com/VILA-Lab/OD3.
Problem

Research questions and friction points this paper is trying to address.

Optimization-free dataset distillation for object detection
Reducing computational demands in dense prediction tasks
Synthesizing compact datasets for complex detection settings
Innovation

Methods, ideas, or system contributions that make the work stand out.

Optimization-free dataset distillation for detection
Two-stage candidate selection and screening
Pre-trained observer model filters low-confidence objects
πŸ”Ž Similar Papers
No similar papers found.
Salwa K. Al Khatib
Salwa K. Al Khatib
PhD Student @ MBZUAI
Machine LearningComputer Vision
A
Ahmed ElHagry
Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI)
Shitong Shao
Shitong Shao
The Hong Kong University of Science and Technology (Guangzhou)
Efficient Computer VisionDiffusion ModelsInference-heavy Algorithms
Z
Zhiqiang Shen
Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI)