AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection

📅 2026-03-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a multi-stage dynamic analysis framework powered by a large language model (LLM) agent to address the limitations of existing AI-generated image detection methods, which often rely on a single type of forgery cue and consequently suffer from constrained performance and conflicting outcomes. By synergistically integrating semantic and signal-level evidence, the framework enables context-aware incorporation of expert knowledge and resolves inconsistencies through an expert profiling knowledge base, contextual clustering, and a rapid ensemble mechanism. The resulting paradigm is both scalable and interpretable, producing fine-grained, human-readable forensic reports that significantly enhance detection reliability and practical deployability in real-world scenarios.

Technology Category

Application Category

📝 Abstract
The increasing realism of AI-Generated Images (AIGI) has created an urgent need for forensic tools capable of reliably distinguishing synthetic content from authentic imagery. Existing detectors are typically tailored to specific forgery artifacts--such as frequency-domain patterns or semantic inconsistencies--leading to specialized performance and, at times, conflicting judgments. To address these limitations, we present \textbf{AgentFoX}, a Large Language Model-driven framework that redefines AIGI detection as a dynamic, multi-phase analytical process. Our approach employs a quick-integration fusion mechanism guided by a curated knowledge base comprising calibrated Expert Profiles and contextual Clustering Profiles. During inference, the agent begins with high-level semantic assessment, then transitions to fine-grained, context-aware synthesis of signal-level expert evidence, resolving contradictions through structured reasoning. Instead of returning a coarse binary output, AgentFoX produces a detailed, human-readable forensic report that substantiates its verdict, enhancing interpretability and trustworthiness for real-world deployment. Beyond providing a novel detection solution, this work introduces a scalable agentic paradigm that facilitates intelligent integration of future and evolving forensic tools.
Problem

Research questions and friction points this paper is trying to address.

AI-Generated Image Detection
Forgery Artifacts
Semantic Inconsistencies
Explainability
Forensic Tools
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM Agent
Explainable AI
AI-Generated Image Detection
Knowledge-Guided Fusion
Forensic Reasoning
🔎 Similar Papers
No similar papers found.
Y
Yangxin Yu
Guangdong Provincial Key Laboratory of Intelligent Information Processing, Shenzhen Key Laboratory of Media Security, and SZU-AFS Joint Innovation Center for AI Technology, Shenzhen University, Shenzhen 518060, China
Yue Zhou
Yue Zhou
Shenzhen University, PHD Student
Multimedia ForensicsAI Safety
Bin Li
Bin Li
Nanjing University of Science and Technology
Algorithmic Game TheoryMechanism DesignAuction Theory
Kaiqing Lin
Kaiqing Lin
Shenzhen University
Multimedia ForensicsMultimedia SecuritySteganalysis
Haodong Li
Haodong Li
UC San Diego. Prev: HKUST, ZJU, Tencent.
3DVGenerative ModelsAgents
Jiangqun Ni
Jiangqun Ni
Professor, Sun Yat-Sen University
multimedia security
B
Bo Cao
The Smart City Research Institute of China Electronics Technology Group Corporation, Shenzhen, China