Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection

📅 2025-12-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current AI-based image detectors heavily rely on generator-specific artifacts, rendering them ineffective against cascaded degradation—induced by multi-round cross-platform sharing and post-processing—in real-world scenarios, thereby suffering from poor generalization. To address this, we propose a Real-image-centered Envelope Modeling (REM) paradigm that abandons reliance on volatile artifacts and instead learns a robust manifold boundary of authentic images in feature space. Methodologically, REM synthesizes near-authentic samples via self-reconstructive feature perturbation and jointly optimizes an envelope estimator with cross-domain consistency regularization to learn a compact, resilient envelope of the real-image distribution. Evaluated on eight standard benchmarks, REM achieves an average improvement of 7.5% in detection accuracy. Moreover, on our newly constructed RealChain benchmark—designed to simulate realistic cascaded degradation—it significantly outperforms existing methods, establishing a foundation for highly robust, real-world deployment of AI-generated image detection.

Technology Category

Application Category

📝 Abstract
The rapid progress of generative models has intensified the need for reliable and robust detection under real-world conditions. However, existing detectors often overfit to generator-specific artifacts and remain highly sensitive to real-world degradations. As generative architectures evolve and images undergo multi-round cross-platform sharing and post-processing (chain degradations), these artifact cues become obsolete and harder to detect. To address this, we propose Real-centric Envelope Modeling (REM), a new paradigm that shifts detection from learning generator artifacts to modeling the robust distribution of real images. REM introduces feature-level perturbations in self-reconstruction to generate near-real samples, and employs an envelope estimator with cross-domain consistency to learn a boundary enclosing the real image manifold. We further build RealChain, a comprehensive benchmark covering both open-source and commercial generators with simulated real-world degradation. Across eight benchmark evaluations, REM achieves an average improvement of 7.5% over state-of-the-art methods, and notably maintains exceptional generalization on the severely degraded RealChain benchmark, establishing a solid foundation for synthetic image detection under real-world conditions. The code and the RealChain benchmark will be made publicly available upon acceptance of the paper.
Problem

Research questions and friction points this paper is trying to address.

Detect AI-generated images under real-world degradation conditions
Overcome detector overfitting to obsolete generator-specific artifacts
Maintain detection robustness despite multi-platform sharing and processing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Modeling real image distribution instead of generator artifacts
Using feature-level perturbations and envelope estimator for boundary learning
Introducing RealChain benchmark for real-world degradation evaluation
Ruiqi Liu
Ruiqi Liu
Texas Tech University
nonparametric methodsmachine learningeconometrics
Y
Yi Han
Southwest University
Zhengbo Zhang
Zhengbo Zhang
Singapore University of Technology and Design
Generative ModelsReinforcement Learning
L
Liwei Yao
Shanghai Second Polytechnic University
Z
Zhiyuan Yan
Peking University
J
Jialiang Shen
The University of Sydney
Z
ZhiJin Chen
Institute of Automation, Chinese Academy of Sciences
B
Boyi Sun
Institute of Automation, Chinese Academy of Sciences
L
Lubin Weng
Institute of Automation, Chinese Academy of Sciences
J
Jing Dong
Institute of Automation, Chinese Academy of Sciences
Y
Yan Wang
Tsinghua University
S
Shu Wu
Institute of Automation, Chinese Academy of Sciences