Many Ways to Be Fake: Benchmarking Fake News Detection Under Strategy-Driven AI Generation

📅 2026-04-10
📈 Citations: 0
Influential: 0
📄 PDF

career value

204K/year
🤖 AI Summary
This study addresses the challenge that existing fake news detection methods struggle to identify hybrid disinformation—strategically crafted falsehoods embedded within otherwise truthful content through human–AI collaboration. To bridge this gap, the authors introduce MANYFAKE, a novel benchmark that systematically simulates human–AI collaborative fabrication strategies. Leveraging multi-strategy prompt engineering and large language models, they synthesize 6,798 realistic news articles blending verifiable facts with subtle, optimized fabrications. This dataset fills a critical void in current resources by capturing strategic misinformation scenarios absent from prior benchmarks. Experimental results demonstrate that while state-of-the-art detectors perform well on entirely fabricated news, they remain significantly vulnerable to nuanced, contextually integrated falsehoods characteristic of human–AI hybrid generation.

Technology Category

Application Category

📝 Abstract
Recent advances in large language models (LLMs) have enabled the large-scale generation of highly fluent and deceptive news-like content. While prior work has often treated fake news detection as a binary classification problem, modern fake news increasingly arises through human-AI collaboration, where strategic inaccuracies are embedded within otherwise accurate and credible narratives. These mixed-truth cases represent a realistic and consequential threat, yet they remain underrepresented in existing benchmarks. To address this gap, we introduce MANYFAKE, a synthetic benchmark containing 6,798 fake news articles generated through multiple strategy-driven prompting pipelines that capture many ways fake news can be constructed and refined. Using this benchmark, we evaluate a range of state-of-the-art fake news detectors. Our results show that even advanced reasoning-enabled models approach saturation on fully fabricated stories, but remain brittle when falsehoods are subtle, optimized, and interwoven with accurate information.
Problem

Research questions and friction points this paper is trying to address.

fake news detection
strategy-driven generation
mixed-truth narratives
AI-generated disinformation
benchmarking
Innovation

Methods, ideas, or system contributions that make the work stand out.

strategy-driven generation
mixed-truth fake news
synthetic benchmark
LLM-based disinformation
fake news detection
🔎 Similar Papers
No similar papers found.