DF2023: The Digital Forensics 2023 Dataset for Image Forgery Detection

📅 2025-03-28

📈 Citations: 1

✨ Influential: 1

career value

197K/year

🤖 AI Summary

To address the growing threat of maliciously manipulated images on social media platforms for opinion manipulation, this paper introduces DF2023—the first large-scale, open-source benchmark dataset featuring fine-grained annotations across four major image forgery types: splicing, copy-move, enhancement, and object removal. Comprising over one million samples derived from real-world social media propagation scenarios, DF2023 is constructed via multi-source acquisition and rigorous human annotation, enabling comprehensive evaluation of both forgery localization and classification. It establishes the first systematic unification of these four forgery categories, significantly lowering data barriers for algorithm development and facilitating fair, cross-model benchmarking. Third-party reproductions based on DF2023 demonstrate consistent improvements of 12–18% in cross-category generalization performance across multiple detection models. Thus, DF2023 provides a reproducible, highly compatible, and strongly generalizable benchmark for digital image forensics research.

Technology Category

Application Category

📝 Abstract

The deliberate manipulation of public opinion, especially through altered images, which are frequently disseminated through online social networks, poses a significant danger to society. To fight this issue on a technical level we support the research community by releasing the Digital Forensics 2023 (DF2023) training and validation dataset, comprising one million images from four major forgery categories: splicing, copy-move, enhancement and removal. This dataset enables an objective comparison of network architectures and can significantly reduce the time and effort of researchers preparing datasets.

Problem

Research questions and friction points this paper is trying to address.

Detect manipulated images in online social networks

Provide dataset for image forgery detection research

Compare network architectures for forgery detection

Innovation

Methods, ideas, or system contributions that make the work stand out.

DF2023 dataset for image forgery

Covers four major forgery categories

Enables objective comparison of architectures

🔎 Similar Papers

Revisiting Tampered Scene Text Detection in the Era of Generative AI