π€ AI Summary
The proliferation of generative AI images poses escalating challenges for forgery detection, as existing methods suffer from poor generalization and vulnerability to post-processing distortions. Method: We propose Co-Spy, the first framework that jointly models semantic anomalies (e.g., anatomical inconsistencies) and pixel-level artifacts via dual-path reasoning. It incorporates multi-scale feature enhancement, attention-guided adaptive fusion, contrastive pre-training, and a lightweight classification head for robust detection. Contribution/Results: We introduce Co-Spy-Benchβthe first comprehensive benchmark covering five real-world data categories, 22 state-of-the-art generative models, and 50,000 in-the-wild web images. Under unified training, Co-Spy achieves 11β34% higher average accuracy than prior art, significantly improving cross-model generalization and robustness against JPEG compression and other common post-processing operations. Our code is publicly available.
π Abstract
With the rapid advancement of generative AI, it is now possible to synthesize high-quality images in a few seconds. Despite the power of these technologies, they raise significant concerns regarding misuse. Current efforts to distinguish between real and AI-generated images may lack generalization, being effective for only certain types of generative models and susceptible to post-processing techniques like JPEG compression. To overcome these limitations, we propose a novel framework, Co-Spy, that first enhances existing semantic features (e.g., the number of fingers in a hand) and artifact features (e.g., pixel value differences), and then adaptively integrates them to achieve more general and robust synthetic image detection. Additionally, we create Co-Spy-Bench, a comprehensive dataset comprising 5 real image datasets and 22 state-of-the-art generative models, including the latest models like FLUX. We also collect 50k synthetic images in the wild from the Internet to enable evaluation in a more practical setting. Our extensive evaluations demonstrate that our detector outperforms existing methods under identical training conditions, achieving an average accuracy improvement of approximately 11% to 34%. The code is available at https://github.com/Megum1/Co-Spy.