Fusion-SSAT: Unleashing the Potential of Self-supervised Auxiliary Task by Feature Fusion for Generalized Deepfake Detection

📅 2026-01-02
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited generalization capability of deepfake detection models when confronted with unseen forgery types and cross-dataset scenarios. To mitigate this issue, the authors propose a feature fusion approach that integrates self-supervised learning with multi-task learning. Specifically, a self-supervised task is incorporated as an auxiliary objective alongside the primary detection task during joint training, and an effective feature fusion strategy is designed to combine the representations from both tasks. This integration substantially enhances the model’s generalization performance. Extensive experiments demonstrate that the proposed method consistently outperforms state-of-the-art approaches across multiple benchmark datasets—including DF40, FaceForensics++, and Celeb-DF—with particularly notable gains in cross-dataset evaluation settings.

Technology Category

Application Category

📝 Abstract
In this work, we attempted to unleash the potential of self-supervised learning as an auxiliary task that can optimise the primary task of generalised deepfake detection. To explore this, we examined different combinations of the training schemes for these tasks that can be most effective. Our findings reveal that fusing the feature representation from self-supervised auxiliary tasks is a powerful feature representation for the problem at hand. Such a representation can leverage the ultimate potential and bring in a unique representation of both the self-supervised and primary tasks, achieving better performance for the primary task. We experimented on a large set of datasets, which includes DF40, FaceForensics++, Celeb-DF, DFD, FaceShifter, UADFV, and our results showed better generalizability on cross-dataset evaluation when compared with current state-of-the-art detectors.
Problem

Research questions and friction points this paper is trying to address.

deepfake detection
generalization
self-supervised learning
feature fusion
cross-dataset evaluation
Innovation

Methods, ideas, or system contributions that make the work stand out.

self-supervised learning
feature fusion
generalized deepfake detection
auxiliary task
cross-dataset generalization
S
Shukesh Reddy
Machine Intelligence Group, Department of CS&IS, Birla Institute of Technology and Sciences, Pilani, India
S
Srijan Das
University of North Carolina at Charlotte, United States of America
Abhijit Das
Abhijit Das
BITS Pilani Hyderabad, Dept of CS&IS
Computer VisionPattern RecognitionMachine Learning