Findings of the Counter Turing Test: AI-Generated Image Detection

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

198K/year
🤖 AI Summary
This work addresses the growing challenges of misinformation and model attribution posed by increasingly photorealistic AI-generated images. To systematically evaluate the detectability and traceability of such content, the authors propose and organize the Counter Turing Test competition and introduce the MS COCOAI dataset. They employ a comprehensive suite of methods—including CNNs, Vision Transformers, frequency-domain analysis, contrastive learning, and multimodal approaches—for both image authenticity verification and source model identification. Experimental results demonstrate that while binary detection of real versus synthetic images achieves an F1-score exceeding 0.83, the accuracy of identifying the specific generative model peaks at only 0.4986. This stark performance gap reveals, for the first time, that fine-grained model attribution is substantially more difficult than general forgery detection, highlighting a critical bottleneck in current forensic capabilities.
📝 Abstract
The rapid advancements in generative AI technologies, such as Stable Diffusion, DALL-E, and Midjourney, have significantly transformed the creation of synthetic visual content. While these models enable innovation across industries, they also pose serious challenges, including misinformation, disinformation, and biased content generation. The increasing realism of AI-generated images makes their detection a pressing concern for researchers, policymakers, and industry stakeholders. In this paper, we present the findings of the Defactify 4.0 workshop, which introduced the Counter Turing Test (CT2) for AI-Generated Image Detection. The competition consisted of two key tasks: (1) binary classification of images as either AI-generated or real and (2) identification of the specific generative model responsible for an AI-generated image. To facilitate this, we developed the MS COCOAI dataset, consisting of 50,000 synthetic images from multiple generative models alongside real-world images from the MS COCO dataset. Participants employed diverse detection strategies, including convolutional neural networks (CNNs), Vision Transformers (ViTs), frequency-based analysis, contrastive learning, and multimodal techniques. The results demonstrated that while AI-generated images can be detected with high accuracy (F1-score > 0.83), identifying the exact model used remains significantly more challenging (highest F1-score: 0.4986). These findings highlight the need for improved model fingerprinting, adversarial robustness, and real-time detection mechanisms.
Problem

Research questions and friction points this paper is trying to address.

AI-generated image detection
Counter Turing Test
generative models
image authenticity
model fingerprinting
Innovation

Methods, ideas, or system contributions that make the work stand out.

Counter Turing Test
AI-generated image detection
model fingerprinting
MS COCOAI dataset
generative model identification
🔎 Similar Papers
No similar papers found.
R
Rajarshi Roy
Kalyani Government Engineering College, India
Nasrin Imanpour
Nasrin Imanpour
PhD, Computer Science and Engineering
Artificial IntelligenceMachine LearningComputer Vision
A
Ashhar Aziz
IIIT Delhi, India
S
Shashwat Bajpai
BITS Pilani Hyderabad Campus, India
G
Gurpreet Singh
IIIT Guwahati, India
S
Shwetangshu Biswas
National Institute of Technology Silchar, India
K
Kapil Wanaskar
San José State University, USA
Parth Patwa
Parth Patwa
Amazon
Machine LearningDeep LearningNatural Language ProcessingComputational LinguisticsComputer
Subhankar Ghosh
Subhankar Ghosh
Indian Institute of Technology
Computer VisionMachine LearningArtificial Intelligence
S
Shreyas Dixit
Vishwakarma Institute of Information Technology, India
N
Nilesh Ranjan Pal
Kalyani Government Engineering College, India
Vipula Rawte
Vipula Rawte
AI Institute of University of South Carolina
Text MiningNatural Language ProcessingDeep LearningSemantic WebOntology
Ritvik Garimella
Ritvik Garimella
PhD @ UofSC
NeuroSymbolic AIMultimodal LearningDeep LearningNLP
A
Amitava Das
BITS Pilani, Goa
Amit Sheth
Amit Sheth
NCR Chair & Prof.; Founding Director, AI Institute; U. of South Carolina
Neurosymbolic AIKnowledge GraphKnowledge-infused LearningSemantic WebArtificial Intelligence
Vasu Sharma
Vasu Sharma
Facebook AI Research (FAIR)
Generative AILLMsComputer VisionNatural Language ProcessingMultimodal ML
Aishwarya Naresh Reganti
Aishwarya Naresh Reganti
Amazon
Artificial Social IntelligenceMultimodal MLGraph Neural NetworksNatural Language Processing
Vinija Jain
Vinija Jain
Meta | Ex: Amazon, Oracle, Palo Alto Networks
AINatural Language ProcessingMultimodal AIRecommender SystemsInformation Retrieval
Aman Chadha
Aman Chadha
GenAI Leadership @ Apple • Stanford AI • UW-Madison ECE • Ex: Apple, AWS, Alexa, Nvidia
Multimodal AINatural Language ProcessingComputer VisionSpeech ProcessingRecommender Systems