Many Ways to Be Fake: Benchmarking Fake News Detection Under Strategy-Driven AI Generation

📅 2026-04-10

📈 Citations: 0

✨ Influential: 0

career value

184K/year

🤖 AI Summary

This study addresses the challenge that existing fake news detection methods struggle to identify hybrid disinformation—strategically crafted falsehoods embedded within otherwise truthful content through human–AI collaboration. To bridge this gap, the authors introduce MANYFAKE, a novel benchmark that systematically simulates human–AI collaborative fabrication strategies. Leveraging multi-strategy prompt engineering and large language models, they synthesize 6,798 realistic news articles blending verifiable facts with subtle, optimized fabrications. This dataset fills a critical void in current resources by capturing strategic misinformation scenarios absent from prior benchmarks. Experimental results demonstrate that while state-of-the-art detectors perform well on entirely fabricated news, they remain significantly vulnerable to nuanced, contextually integrated falsehoods characteristic of human–AI hybrid generation.

Technology Category

Application Category

📝 Abstract

Recent advances in large language models (LLMs) have enabled the large-scale generation of highly fluent and deceptive news-like content. While prior work has often treated fake news detection as a binary classification problem, modern fake news increasingly arises through human-AI collaboration, where strategic inaccuracies are embedded within otherwise accurate and credible narratives. These mixed-truth cases represent a realistic and consequential threat, yet they remain underrepresented in existing benchmarks. To address this gap, we introduce MANYFAKE, a synthetic benchmark containing 6,798 fake news articles generated through multiple strategy-driven prompting pipelines that capture many ways fake news can be constructed and refined. Using this benchmark, we evaluate a range of state-of-the-art fake news detectors. Our results show that even advanced reasoning-enabled models approach saturation on fully fabricated stories, but remain brittle when falsehoods are subtle, optimized, and interwoven with accurate information.

Problem

Research questions and friction points this paper is trying to address.

fake news detection

strategy-driven generation

mixed-truth narratives

AI-generated disinformation

benchmarking

Innovation

Methods, ideas, or system contributions that make the work stand out.

strategy-driven generation

mixed-truth fake news

synthetic benchmark