Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?

📅 2025-07-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the weak generalization capability and heavy reliance on labeled data in zero-shot face presentation attack detection (PAD), this paper proposes a foundation-model-based zero-shot PAD framework. The method leverages pre-trained foundation models to extract robust facial features and enables label-free recognition of unseen attack types—such as novel 2D or 3D attacks—via zero-shot similarity matching, without fine-tuning. Its core innovation lies in constructing a prompt-guided feature space with cross-domain semantic alignment, substantially enhancing generalization across unknown attacks and heterogeneous benchmark databases. Under the “leave-one-attack-out” protocol on SiW-Mv2, the proposed approach achieves significant performance gains over existing state-of-the-art methods. This work establishes a new paradigm for biometric security systems operating under low-resource constraints while demanding high generalization capability.

Technology Category

Application Category

📝 Abstract
Although face recognition systems have undergone an impressive evolution in the last decade, these technologies are vulnerable to attack presentations (AP). These attacks are mostly easy to create and, by executing them against the system's capture device, the malicious actor can impersonate an authorised subject and thus gain access to the latter's information (e.g., financial transactions). To protect facial recognition schemes against presentation attacks, state-of-the-art deep learning presentation attack detection (PAD) approaches require a large amount of data to produce reliable detection performances and even then, they decrease their performance for unknown presentation attack instruments (PAI) or database (information not seen during training), i.e. they lack generalisability. To mitigate the above problems, this paper focuses on zero-shot PAD. To do so, we first assess the effectiveness and generalisability of foundation models in established and challenging experimental scenarios and then propose a simple but effective framework for zero-shot PAD. Experimental results show that these models are able to achieve performance in difficult scenarios with minimal effort of the more advanced PAD mechanisms, whose weights were optimised mainly with training sets that included APs and bona fide presentations. The top-performing foundation model outperforms by a margin the best from the state of the art observed with the leaving-one-out protocol on the SiW-Mv2 database, which contains challenging unknown 2D and 3D attacks
Problem

Research questions and friction points this paper is trying to address.

Detecting zero-shot face presentation attacks effectively
Assessing generalizability of foundation models in PAD
Improving PAD performance for unknown attack types
Innovation

Methods, ideas, or system contributions that make the work stand out.

Utilizes foundation models for zero-shot PAD
Assesses generalizability in challenging scenarios
Outperforms state-of-the-art with minimal effort
🔎 Similar Papers
No similar papers found.
L
Lazaro Janier Gonzalez-Sole
da/sec - Biometrics and Security Research Group, Darmstadt, Germany
J
Juan E. Tapia
da/sec - Biometrics and Security Research Group, Darmstadt, Germany
Christoph Busch
Christoph Busch
Professor for Biometrics, Norwegian University of Science and Technology (NTNU)
Biometrics