The paradigm shift: A comprehensive survey on large vision language models for multimodal fake news detection

📅 2026-01-16
🏛️ Computer Science Review
📈 Citations: 1
Influential: 1
📄 PDF
🤖 AI Summary
Multimodal fake news detection has long been constrained by shallow feature fusion, struggling to capture high-level semantics and complex cross-modal interactions. This work systematically reviews the paradigm shift driven by Large Vision-Language Models (LVLMs), tracing the evolution from traditional approaches to end-to-end unified reasoning frameworks. It presents the first structured taxonomy encompassing model architectures, datasets, and evaluation benchmarks. By mapping the trajectory of technical advancements, the study offers an in-depth analysis of critical challenges—including interpretability, temporal reasoning, and domain generalization—and constructs a comprehensive technical landscape for multimodal fake news detection in the LVLM era, providing both theoretical insights and practical guidance for future research.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

multimodal fake news detection
large vision-language models
paradigm shift
systematic survey
cross-modal reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large Vision-Language Models
Multimodal Fake News Detection
Paradigm Shift
Foundation Models
Cross-Modal Reasoning
🔎 Similar Papers
No similar papers found.
W
Wei Ai
College of Computer and Mathematics, Central South University of Forestry and Technology, Hunan, Changsha, China
Y
Yilong Tan
College of Computer and Mathematics, Central South University of Forestry and Technology, Hunan, Changsha, China
Y
Yuntao Shou
College of Computer and Mathematics, Central South University of Forestry and Technology, Hunan, Changsha, China
Tao Meng
Tao Meng
Central South University of Forestry and Technology
Graph Neural NetworkMultimodal Emotion RecognitionText ClassificationEntity Alignment
H
Haowen Chen
College of Computer Science and Electronic Engineering, Hunan University, Hunan, Changsha, China
Z
Zhixiong He
College of Economics and Management, Central South University of Forestry and Technology, Hunan, Changsha, China
Keqin Li
Keqin Li
AMA University
RoboticMachine learningArtificial intelligenceComputer vision