Towards Generalizable Deepfake Detection via Real Distribution Bias Correction

📅 2026-03-14

📈 Citations: 0

✨ Influential: 0

career value

260K/year

🤖 AI Summary

This work addresses the limited generalization of existing deepfake detection methods, which rely heavily on known forgery samples and struggle with unseen manipulation types. To overcome this, the authors propose the Real Distribution Bias Correction (RDBC) framework, which enhances generalization without requiring any forged samples by leveraging two complementary perspectives on authentic data: the global class distribution and the intrinsic Gaussianity of individual samples. RDBC employs a real population distribution estimation module and a distribution-aware feature whitening module to model distribution parameters using the i.i.d. property of genuine samples and amplify the discrepancy in Gaussianity between real and fake instances. Extensive experiments demonstrate that RDBC achieves state-of-the-art performance under both in-domain and cross-domain settings, significantly improving detection robustness against previously unseen forgery techniques.

Technology Category

Application Category

📝 Abstract

To generalize deepfake detectors to future unseen forgeries, most existing methods attempt to simulate the dynamically evolving forgery types using available source domain data. However, predicting an unbounded set of future manipulations from limited prior examples is infeasible. To overcome this limitation, we propose to exploit the invariance of \textbf{real data} from two complementary perspectives: the fixed population distribution of the entire real class and the inherent Gaussianity of individual real images. Building on these properties, we introduce the Real Distribution Bias Correction (RDBC) framework, which consists of two key components: the Real Population Distribution Estimation module and the Distribution-Sampled Feature Whitening module. The former utilizes the independent and identically distributed (\iid) property of real samples to derive the normal distribution form of their statistics, from which the distribution parameters can be estimated using limited source domain data. Based on the learned population distribution, the latter utilizes the inherent Gaussianity of real data as a discriminative prior and performs a sampling-based whitening operation to amplify the Gaussianity gap between real and fake samples. Through synergistic coupling of the two modules, our model captures the real-world properties of real samples, thereby enhancing its generalizability to unseen target domains. Extensive experiments demonstrate that RDBC achieves state-of-the-art performance in both in-domain and cross-domain deepfake detection.

Problem

Research questions and friction points this paper is trying to address.

deepfake detection

generalization

real distribution

unseen forgeries

domain generalization

Innovation

Methods, ideas, or system contributions that make the work stand out.

Real Distribution Bias Correction

Gaussianity

Generalizable Deepfake Detection