Morphology-optimized Multi-Scale Fusion: Combining Local Artifacts and Mesoscopic Semantics for Deepfake Detection and Localization

📅 2025-09-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of insufficient coordination between local details and global semantics, and noise susceptibility in multi-scale prediction fusion for deepfake detection and localization, this paper proposes a multi-scale morphological optimization fusion framework. Methodologically, it introduces a dual-branch architecture comprising local feature extraction and mesoscopic semantic analysis, and innovatively incorporates morphological opening and closing operations to enforce spatial consistency across scales and suppress prediction noise. This design effectively mitigates error amplification inherent in conventional fusion strategies, thereby significantly improving localization accuracy and robustness. Extensive experiments on multiple benchmark datasets demonstrate that the proposed method consistently outperforms state-of-the-art approaches in both localization accuracy and model stability, validating the effectiveness and generalizability of morphologically guided multi-scale fusion.

Technology Category

Application Category

📝 Abstract
While the pursuit of higher accuracy in deepfake detection remains a central goal, there is an increasing demand for precise localization of manipulated regions. Despite the remarkable progress made in classification-based detection, accurately localizing forged areas remains a significant challenge. A common strategy is to incorporate forged region annotations during model training alongside manipulated images. However, such approaches often neglect the complementary nature of local detail and global semantic context, resulting in suboptimal localization performance. Moreover, an often-overlooked aspect is the fusion strategy between local and global predictions. Naively combining the outputs from both branches can amplify noise and errors, thereby undermining the effectiveness of the localization. To address these issues, we propose a novel approach that independently predicts manipulated regions using both local and global perspectives. We employ morphological operations to fuse the outputs, effectively suppressing noise while enhancing spatial coherence. Extensive experiments reveal the effectiveness of each module in improving the accuracy and robustness of forgery localization.
Problem

Research questions and friction points this paper is trying to address.

Accurately localizing forged regions in deepfake detection
Neglecting complementary local and global semantic contexts
Suboptimal fusion strategy amplifying noise and errors
Innovation

Methods, ideas, or system contributions that make the work stand out.

Morphology-optimized multi-scale fusion strategy
Independent local and global prediction branches
Morphological operations for noise suppression
🔎 Similar Papers
No similar papers found.
Chao Shuai
Chao Shuai
Zhejiang University
G
Gaojian Wang
The State Key Laboratory of Blockchain and Data Security, Zhejiang University
K
Kun Pan
The State Key Laboratory of Blockchain and Data Security, Zhejiang University
T
Tong Wu
The State Key Laboratory of Blockchain and Data Security, Zhejiang University
F
Fanli Jin
The State Key Laboratory of Blockchain and Data Security, Zhejiang University
H
Haohan Tan
The State Key Laboratory of Blockchain and Data Security, Zhejiang University
M
Mengxiang Li
The State Key Laboratory of Blockchain and Data Security, Zhejiang University
Zhenguang Liu
Zhenguang Liu
Zhejiang University
BlockchainSmart Contract SecurityMultimedia
F
Feng Lin
The State Key Laboratory of Blockchain and Data Security, Zhejiang University; Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security
Kui Ren
Kui Ren
Professor and Dean of Computer Science, Zhejiang University, ACM/IEEE Fellow
Data Security & PrivacyAI SecurityIoT & Vehicular Security