Exploring Modality Disruption in Multimodal Fake News Detection

📅 2025-04-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Multimodal fake news detection on social media suffers from “modality disruption”: exaggerated or stylistically embellished content in textual and visual modalities misleads discriminative learning. This work formally introduces the concept of “modality interference,” empirically demonstrating its detrimental impact on detection performance. To address this, we propose FND-MoE—a novel framework integrating Mixture-of-Experts (MoE), cross-modal feature alignment, and a dual-path adaptive feature selection mechanism to dynamically identify and suppress interfering modalities. Additionally, we introduce a modality importance reweighting strategy to enhance robustness. Evaluated on FakeSV and FVC-2018 benchmarks, FND-MoE achieves absolute accuracy improvements of 3.45% and 3.71%, respectively, surpassing state-of-the-art methods. Our approach establishes a new paradigm for robust multimodal fake news detection by explicitly modeling and mitigating modality-specific interference.

Technology Category

Application Category

📝 Abstract
The rapid growth of social media has led to the widespread dissemination of fake news across multiple content forms, including text, images, audio, and video. Compared to unimodal fake news detection, multimodal fake news detection benefits from the increased availability of information across multiple modalities. However, in the context of social media, certain modalities in multimodal fake news detection tasks may contain disruptive or over-expressive information. These elements often include exaggerated or embellished content. We define this phenomenon as modality disruption and explore its impact on detection models through experiments. To address the issue of modality disruption in a targeted manner, we propose a multimodal fake news detection framework, FND-MoE. Additionally, we design a two-pass feature selection mechanism to further mitigate the impact of modality disruption. Extensive experiments on the FakeSV and FVC-2018 datasets demonstrate that FND-MoE significantly outperforms state-of-the-art methods, with accuracy improvements of 3.45% and 3.71% on the respective datasets compared to baseline models.
Problem

Research questions and friction points this paper is trying to address.

Addressing modality disruption in multimodal fake news detection
Proposing FND-MoE framework to mitigate disruptive modality effects
Improving detection accuracy via two-pass feature selection mechanism
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposes FND-MoE framework for fake news detection
Introduces two-pass feature selection mechanism
Addresses modality disruption in multimodal content
🔎 Similar Papers
No similar papers found.
Moyang Liu
Moyang Liu
Beihang University
MultimediaMultimodal Learning
Kaiying Yan
Kaiying Yan
Sun Yat-sen University
Y
Yukun Liu
School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
Ruibo Fu
Ruibo Fu
Associate Professor,CASIA
AIGCLMMIntelligent speech interactionDeepfake detection
Zhengqi Wen
Zhengqi Wen
Tshinghua University
LLM
X
Xuefei Liu
Institute of Automation, Chinese Academy of Sciences, Beijing, China
C
Chenxing Li
Tencent AI Lab