Findings of the Counter Turing Test: AI-Generated Text Detection

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

180K/year
🤖 AI Summary
This study addresses the growing challenge of distinguishing text generated by large language models from human-written content and attributing it to specific models—a task critical for mitigating disinformation and security risks. Leveraging the Counter Turing Test shared task, the work presents the first systematic evaluation of diverse detection approaches across both binary classification (human vs. AI) and fine-grained model attribution tasks. By integrating fine-tuned Transformers (e.g., DeBERTa, BART), ensemble methods, and hybrid strategies, the proposed framework achieves a perfect F1 score of 1.0000 in binary detection and a top F1 of 0.9531 in model attribution. These results highlight the current limitations of existing techniques in fine-grained溯源 tasks, underscoring the need for further research in this emerging domain.
📝 Abstract
The rapid proliferation of AI-generated text has introduced significant challenges in maintaining the integrity of digital content. Advanced generative models such as GPT-4, Claude 3.5, and Llama can produce highly coherent and human-like text, making it increasingly difficult to differentiate between human-written and AI-generated content. While these models have transformative applications, their misuse has raised concerns about misinformation, biased narratives, and security threats. This paper provides a comprehensive analysis of state-of-the-art AI-generated text detection techniques and evaluates their effectiveness through the Counter Turing Test (CT2) shared tasks. Task A (Binary Classification) required participants to distinguish between human-written and AI-generated text, while Task B (Model Attribution) focused on identifying the specific language model responsible for generating a given text. The results demonstrated high performance in binary classification, with the top system achieving an F1 score of 1.0000, but significantly lower scores in model attribution, where the best system achieved 0.9531, highlighting the increased complexity of this task. The top-performing teams leveraged fine-tuned transformer models, ensemble learning, and hybrid detection approaches, with DeBERTa-based and BART-based methods demonstrating strong results. However, the lower scores in Task B underscore the challenges of distinguishing outputs from different LLMs, necessitating further research into adversarial robustness, feature extraction, and cross-domain generalization.
Problem

Research questions and friction points this paper is trying to address.

AI-generated text detection
Counter Turing Test
model attribution
text authenticity
large language models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Counter Turing Test
AI-generated text detection
model attribution
DeBERTa
ensemble learning
🔎 Similar Papers
2024-06-21Journal of Artificial Intelligence ResearchCitations: 6
R
Rajarshi Roy
Kalyani Government Engineering College, India
G
Gurpreet Singh
IIIT Guwahati, India
A
Ashhar Aziz
IIIT Delhi, India
S
Shashwat Bajpai
BITS Pilani Hyderabad Campus, India
Nasrin Imanpour
Nasrin Imanpour
PhD, Computer Science and Engineering
Artificial IntelligenceMachine LearningComputer Vision
S
Shwetangshu Biswas
National Institute of Technology Silchar, India
K
Kapil Wanaskar
San José State University, USA
Parth Patwa
Parth Patwa
Amazon
Machine LearningDeep LearningNatural Language ProcessingComputational LinguisticsComputer
Subhankar Ghosh
Subhankar Ghosh
Indian Institute of Technology
Computer VisionMachine LearningArtificial Intelligence
S
Shreyas Dixit
Vishwakarma Institute of Information Technology, India
N
Nilesh Ranjan Pal
Kalyani Government Engineering College, India
Vipula Rawte
Vipula Rawte
AI Institute of University of South Carolina
Text MiningNatural Language ProcessingDeep LearningSemantic WebOntology
Ritvik Garimella
Ritvik Garimella
PhD @ UofSC
NeuroSymbolic AIMultimodal LearningDeep LearningNLP
A
Amitava Das
BITS Pilani, Goa
Amit Sheth
Amit Sheth
NCR Chair & Prof.; Founding Director, AI Institute; U. of South Carolina
Neurosymbolic AIKnowledge GraphKnowledge-infused LearningSemantic WebArtificial Intelligence
Vasu Sharma
Vasu Sharma
Facebook AI Research (FAIR)
Generative AILLMsComputer VisionNatural Language ProcessingMultimodal ML
Aishwarya Naresh Reganti
Aishwarya Naresh Reganti
Amazon
Artificial Social IntelligenceMultimodal MLGraph Neural NetworksNatural Language Processing
Vinija Jain
Vinija Jain
Meta | Ex: Amazon, Oracle, Palo Alto Networks
AINatural Language ProcessingMultimodal AIRecommender SystemsInformation Retrieval
Aman Chadha
Aman Chadha
GenAI Leadership @ Apple • Stanford AI • UW-Madison ECE • Ex: Apple, AWS, Alexa, Nvidia
Multimodal AINatural Language ProcessingComputer VisionSpeech ProcessingRecommender Systems