Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection

📅 2025-04-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing generative data augmentation methods for autonomous driving suffer from spatial layout distortion of inserted objects, primarily due to neglecting physical plausibility and semantic constraints—resulting in unrealistic object placements. Method: This paper proposes a scene-aware probabilistic placement modeling framework that jointly encodes geometric structure, semantic relationships, and contextual cues by integrating multimodal scene understanding, probabilistic graphical models, and generative image inpainting. A learnable spatial prior is further introduced to predict physically feasible and semantically consistent placement regions aligned with real-world driving scenes. Contribution/Results: Evaluated on two autonomous driving object detection benchmarks, our method establishes new state-of-the-art performance for generative augmentation, achieving a +1.4 mAP improvement over the previous best approach (a 2.8× gain relative to the runner-up). It also yields significant gains in instance segmentation accuracy.

Technology Category

Application Category

📝 Abstract
Generative image models are increasingly being used for training data augmentation in vision tasks. In the context of automotive object detection, methods usually focus on producing augmented frames that look as realistic as possible, for example by replacing real objects with generated ones. Others try to maximize the diversity of augmented frames, for example by pasting lots of generated objects onto existing backgrounds. Both perspectives pay little attention to the locations of objects in the scene. Frame layouts are either reused with little or no modification, or they are random and disregard realism entirely. In this work, we argue that optimal data augmentation should also include realistic augmentation of layouts. We introduce a scene-aware probabilistic location model that predicts where new objects can realistically be placed in an existing scene. By then inpainting objects in these locations with a generative model, we obtain much stronger augmentation performance than existing approaches. We set a new state of the art for generative data augmentation on two automotive object detection tasks, achieving up to $2.8 imes$ higher gains than the best competing approach ($+1.4$ vs. $+0.5$ mAP boost). We also demonstrate significant improvements for instance segmentation.
Problem

Research questions and friction points this paper is trying to address.

Enhancing realism in automotive object detection data augmentation
Improving scene-aware object placement for augmented frames
Maximizing performance gains in generative data augmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Scene-aware probabilistic location model predicts realistic object placement
Generative inpainting enhances augmented object realism in scenes
Achieves state-of-the-art performance in automotive object detection
🔎 Similar Papers
No similar papers found.
J
Jens Petersen
Qualcomm AI Research
Davide Abati
Davide Abati
Qualcomm AI Research
Computer VisionMachine Learning
A
A. Habibian
Qualcomm AI Research
A
A. Wiggers
Qualcomm AI Research