Human-AI Ensembles Improve Deepfake Detection in Low-to-Medium Quality Videos

๐Ÿ“… 2026-03-15
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses the significant performance degradation of existing deepfake detection methods on real-world videos of low to moderate visual quality. To this end, the authors systematically compare the detection capabilities of 200 human participants against 95 state-of-the-art AI detectors on both the standard DF40 benchmark and a newly curated dataset, CharadesDF, comprising everyday smartphone-captured videos. The work reveals, for the first time, a complementary relationship between human and AI error patterns in deepfake detection and proposes an integrated humanโ€“AI strategy that substantially reduces high-confidence misclassifications. Experimental results demonstrate that human accuracy on CharadesDF reaches 0.784, markedly surpassing the AI performance of 0.537, thereby underscoring the necessity and efficacy of humanโ€“AI collaboration for robust deepfake detection in realistic scenarios.

Technology Category

Application Category

๐Ÿ“ Abstract
Deepfake detection is widely framed as a machine learning problem, yet how humans and AI detectors compare under realistic conditions remains poorly understood. We evaluate 200 human participants and 95 state-of-the-art AI detectors across two datasets: DF40, a standard benchmark, and CharadesDF, a novel dataset of videos of everyday activities. CharadesDF was recorded using mobile phones leading to low/moderate quality videos compared to the more professionally captured DF40. Humans outperform AI detectors on both datasets, with the gap widening in the case of CharadesDF where AI accuracy collapses to near chance (0.537) while humans maintain robust performance (0.784). Human and AI errors are complementary: humans miss high-quality deepfakes while AI detectors flag authentic videos as fake, and hybrid human-AI ensembles reduce high-confidence errors. These findings suggest that effective real-world deepfake detection, especially in non-professionally produced videos, requires human-AI collaboration rather than AI algorithms alone.
Problem

Research questions and friction points this paper is trying to address.

deepfake detection
low-quality videos
human-AI collaboration
real-world conditions
AI performance gap
Innovation

Methods, ideas, or system contributions that make the work stand out.

human-AI collaboration
deepfake detection
low-quality video
complementary errors
ensemble detection
๐Ÿ”Ž Similar Papers
No similar papers found.
Marco Postiglione
Marco Postiglione
Postdoc, Northwestern University
Deep LearningNatural Language ProcessingGraph Analytics
I
Isabel Gortner
Northwestern University
V
V. S. Subrahmanian
Northwestern University