Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study

📅 2025-06-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of synthetic data failing to approximate real-world distributions in manufacturing object detection, this paper proposes an end-to-end synthetic data generation pipeline covering object attributes, background, illumination, camera parameters, and post-processing, and introduces the SIP15-OD industrial component detection dataset. We systematically quantify, for the first time, the impact of material properties, rendering techniques, post-processing, and distractors on sim-to-real transfer. A fine-grained, multi-factor joint domain randomization strategy tailored to manufacturing scenarios is introduced. Leveraging physics-aware rendering in Blender/Unity and YOLOv8-based training, our model achieves 96.4% mAP@50 on a public robotics benchmark and 94.1%, 99.5%, and 95.3% mAP@50 across three SIP15-OD operational conditions—performance comparable to models trained exclusively on real data.

Technology Category

Application Category

📝 Abstract
This paper addresses key aspects of domain randomization in generating synthetic data for manufacturing object detection applications. To this end, we present a comprehensive data generation pipeline that reflects different factors: object characteristics, background, illumination, camera settings, and post-processing. We also introduce the Synthetic Industrial Parts Object Detection dataset (SIP15-OD) consisting of 15 objects from three industrial use cases under varying environments as a test bed for the study, while also employing an industrial dataset publicly available for robotic applications. In our experiments, we present more abundant results and insights into the feasibility as well as challenges of sim-to-real object detection. In particular, we identified material properties, rendering methods, post-processing, and distractors as important factors. Our method, leveraging these, achieves top performance on the public dataset with Yolov8 models trained exclusively on synthetic data; mAP@50 scores of 96.4% for the robotics dataset, and 94.1%, 99.5%, and 95.3% across three of the SIP15-OD use cases, respectively. The results showcase the effectiveness of the proposed domain randomization, potentially covering the distribution close to real data for the applications.
Problem

Research questions and friction points this paper is trying to address.

Exploring domain randomization for synthetic data in manufacturing object detection
Developing a comprehensive pipeline for synthetic data generation with varied factors
Evaluating sim-to-real feasibility and challenges in industrial object detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Comprehensive synthetic data generation pipeline
Synthetic Industrial Parts Object Detection dataset
Domain randomization for sim-to-real detection
🔎 Similar Papers
No similar papers found.
X
Xiaomeng Zhu
Scania CV AB, Södertälje, Sweden
J
Jacob Henningsson
D
Duruo Li
P
Par Maartensson
L
Lars Hanson
Skövde University, Skövde, Sweden
M
Maarten Bjorkman
KTH Royal Institute of Technology, Stockholm, Sweden
Atsuto Maki
Atsuto Maki
Professor of Computer Science, KTH Royal Institute of Technology
computer visionmachine learning