Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks

📅 2025-09-18
📈 Citations: 0
✨ Influential: 0
📄 PDF
🤖 AI Summary
Detecting odor-related objects in historical artworks faces severe challenges due to sparse annotations and extreme class imbalance—stemming from stylistic diversity and the need for fine-grained categorization. Method: We propose a synthetic data augmentation framework based on Latent Diffusion Models (LDMs). By fine-tuning a pre-trained LDM, we generate high-fidelity, semantically controllable images of odor-related objects. We further introduce multi-strategy diffusion-based augmentation—including conditional guidance, localized inpainting, and style alignment—to effectively enrich underrepresented classes. Contribution/Results: Joint training on real and synthetic data significantly improves detection performance in low-resource settings, achieving substantial gains in mean Average Precision (mAP). Our approach demonstrates strong few-shot generalization and scalability for long-tailed domains such as cultural heritage analysis, where annotated data is scarce and class distributions are highly skewed.

Technology Category

Application Category

📝 Abstract
Finding smell references in historic artworks is a challenging problem. Beyond artwork-specific challenges such as stylistic variations, their recognition demands exceptionally detailed annotation classes, resulting in annotation sparsity and extreme class imbalance. In this work, we explore the potential of synthetic data generation to alleviate these issues and enable accurate detection of smell-related objects. We evaluate several diffusion-based augmentation strategies and demonstrate that incorporating synthetic data into model training can improve detection performance. Our findings suggest that leveraging the large-scale pretraining of diffusion models offers a promising approach for improving detection accuracy, particularly in niche applications where annotations are scarce and costly to obtain. Furthermore, the proposed approach proves to be effective even with relatively small amounts of data, and scaling it up provides high potential for further enhancements.
Problem

Research questions and friction points this paper is trying to address.

Detecting smell-related objects in historical artworks
Addressing annotation sparsity and class imbalance
Using synthetic data generation to improve detection accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Latent diffusion models generate synthetic data
Augmentation strategies address annotation sparsity
Pretrained diffusion models improve detection accuracy
🔎 Similar Papers
No similar papers found.
A
Ahmed Sheta
Pattern Recognition Lab, Friedrich-Alexander Universität Erlangen-Nßrnberg, Germany
Mathias Zinnen
Mathias Zinnen
Pattern Recognition Lab, Friedrich-Alexander-Universität
Machine LearningComputer VisionObject DetectionDigital HumanitiesSelf-Supervised Learning
A
Aline Sindel
Pattern Recognition Lab, Friedrich-Alexander Universität Erlangen-Nßrnberg, Germany
A
Andreas Maier
Pattern Recognition Lab, Friedrich-Alexander Universität Erlangen-Nßrnberg, Germany
Vincent Christlein
Vincent Christlein
University Erlangen-Nuremberg
Computer VisionDocument AnalysisArt AnalysisComputational HumanitiesAI4Conservation