BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification

📅 2025-04-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the fine-grained, two-level classification of hazards and products in food recall reports, with particular emphasis on low accuracy for minority classes. We propose a class-specific, word-level data augmentation strategy and systematically evaluate the impact of synonym replacement, random word swapping, and context-aware word insertion on both Transformer-based models (e.g., BERT) and traditional machine learning classifiers. To our knowledge, this is the first work to empirically demonstrate—within an interpretable food hazard classification setting—that context-aware word insertion significantly improves minority-hazard class accuracy (+6%, *p* < 0.05), with gains exhibiting class-specificity rather than universal improvement. Results indicate that augmentation strategies must be tailored per class rather than applied uniformly, and that BERT achieves statistically significant performance gains in fine-grained classification. This work establishes an interpretable, reproducible data augmentation paradigm for few-shot classification in the food safety domain.

Technology Category

Application Category

📝 Abstract
This paper presents our system developed for the SemEval-2025 Task 9: The Food Hazard Detection Challenge. The shared task's objective is to evaluate explainable classification systems for classifying hazards and products in two levels of granularity from food recall incident reports. In this work, we propose text augmentation techniques as a way to improve poor performance on minority classes and compare their effect for each category on various transformer and machine learning models. We explore three word-level data augmentation techniques, namely synonym replacement, random word swapping, and contextual word insertion. The results show that transformer models tend to have a better overall performance. None of the three augmentation techniques consistently improved overall performance for classifying hazards and products. We observed a statistically significant improvement (P<0.05) in the fine-grained categories when using the BERT model to compare the baseline with each augmented model. Compared to the baseline, the contextual words insertion augmentation improved the accuracy of predictions for the minority hazard classes by 6%. This suggests that targeted augmentation of minority classes can improve the performance of transformer models.
Problem

Research questions and friction points this paper is trying to address.

Improving food hazard classification performance on minority classes
Comparing data augmentation techniques for transformer and ML models
Evaluating explainable systems for granular food recall hazard detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Text augmentation for minority class improvement
Synonym replacement and word swapping
Contextual word insertion boosts accuracy
🔎 Similar Papers
No similar papers found.
F
Foteini Papadopoulou
Centre for Language Studies, Radboud University, The Netherlands
Osman Mutlu
Osman Mutlu
Wageningen University & Research
Explainable Artificial IntelligenceFederated LearningNatural Language Processing
N
Neris Ozen
Wageningen Food Safety Research, The Netherlands
B
Bas H. M. van der Velden
Wageningen Food Safety Research, The Netherlands
Iris Hendrickx
Iris Hendrickx
Center for Language Studies, Radboud University Nijmegen, The Netherlands
Computational linguisticslinguistics
A
Ali Hurriyetouglu
Wageningen Food Safety Research, The Netherlands