Open-Set Source Tracing of Audio Deepfake Systems

📅 2025-07-08

📈 Citations: 0

✨ Influential: 0

career value

271K/year

🤖 AI Summary

Open-set provenance attribution for audio deepfake systems—robustly identifying a large number of previously unseen forgery systems—remains a critical challenge. Method: We propose a softmax energy (SME)-based out-of-distribution detection framework tailored for provenance attribution. Our approach innovatively integrates SME into the attribution task via an SME-guided training mechanism and synergistically combines multiple data augmentation strategies: temperature scaling, copy-synthesis, codec-induced distortion, and reverberation enhancement. Contribution/Results: Evaluated under the Interspeech 2025 benchmark protocol, our method significantly improves open-set generalization, achieving an FPR95 of 8.3%—a 31% relative reduction over the prior state of the art. This establishes a novel paradigm for tracing previously unknown audio deepfake systems in real-world scenarios.

Technology Category

Application Category

📝 Abstract

Existing research on source tracing of audio deepfake systems has focused primarily on the closed-set scenario, while studies that evaluate open-set performance are limited to a small number of unseen systems. Due to the large number of emerging audio deepfake systems, robust open-set source tracing is critical. We leverage the protocol of the Interspeech 2025 special session on source tracing to evaluate methods for improving open-set source tracing performance. We introduce a novel adaptation to the energy score for out-of-distribution (OOD) detection, softmax energy (SME). We find that replacing the typical temperature-scaled energy score with SME provides a relative average improvement of 31% in the standard FPR95 (false positive rate at true positive rate of 95%) measure. We further explore SME-guided training as well as copy synthesis, codec, and reverberation augmentations, yielding an FPR95 of 8.3%.

Problem

Research questions and friction points this paper is trying to address.

Improving open-set source tracing for audio deepfakes

Enhancing OOD detection using softmax energy adaptation

Reducing FPR95 with SME-guided training and augmentations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces softmax energy (SME) for OOD detection

Uses SME-guided training for performance improvement

Applies copy synthesis, codec, and reverberation augmentations

🔎 Similar Papers

Audio Anti-Spoofing Detection: A Survey