π€ AI Summary
Social media compression and re-encoding severely degrade AI-generated image detection performance, yet existing benchmarks lack realistic propagation distortions. Method: We introduce TrueFake, a large-scale benchmark of 600K images encompassing synthetics from state-of-the-art generators (e.g., SDXL, DALLΒ·E 3, StyleGAN3) and their post-propagation variants distorted by Instagram, X, and Facebook pipelines. We systematically model forensic trace degradation along social dissemination chains and propose an βin-the-wildβ evaluation paradigm. Using multi-scale feature analysis and noise-aware robust training, we quantify detector performance drops under real-world conditions. Contribution/Results: We reveal that mainstream detectors suffer 32β57% average accuracy decline after social propagation. We further demonstrate that contrastive learning and noise-aware fine-tuning significantly improve cross-platform generalization. TrueFake establishes the first reproducible, operationally realistic benchmark and optimization framework for industrial-grade AI-generated image detection.
π Abstract
AI-generated synthetic media are increasingly used in real-world scenarios, often with the purpose of spreading misinformation and propaganda through social media platforms, where compression and other processing can degrade fake detection cues. Currently, many forensic tools fail to account for these in-the-wild challenges. In this work, we introduce TrueFake, a large-scale benchmarking dataset of 600,000 images including top notch generative techniques and sharing via three different social networks. This dataset allows for rigorous evaluation of state-of-the-art fake image detectors under very realistic and challenging conditions. Through extensive experimentation, we analyze how social media sharing impacts detection performance, and identify current most effective detection and training strategies. Our findings highlight the need for evaluating forensic models in conditions that mirror real-world use.