🤖 AI Summary
Facial recognition systems are vulnerable to backdoor attacks, where adversaries embed subtle triggers—e.g., stickers or masks—into a small subset of training images, causing the model to misclassify adversarial users as targeted identities during authentication. Existing defenses struggle to simultaneously achieve high poisoning-sample detection accuracy and preserve data utility. This paper proposes TrueBiometric, the first framework that synergistically integrates multiple vision-language models (VLMs) with a majority-voting mechanism to precisely localize triggers. It further combines gradient-based trigger identification with targeted calibration noise injection to enable lossless repair of poisoned samples. Evaluated across multiple benchmarks, TrueBiometric achieves 100% detection and correction rates for backdoor images while maintaining the original face recognition accuracy—outperforming state-of-the-art defenses by a significant margin.
📝 Abstract
Biometric systems, such as face recognition systems powered by deep neural networks (DNNs), rely on large and highly sensitive datasets. Backdoor attacks can subvert these systems by manipulating the training process. By inserting a small trigger, such as a sticker, make-up, or patterned mask, into a few training images, an adversary can later present the same trigger during authentication to be falsely recognized as another individual, thereby gaining unauthorized access. Existing defense mechanisms against backdoor attacks still face challenges in precisely identifying and mitigating poisoned images without compromising data utility, which undermines the overall reliability of the system. We propose a novel and generalizable approach, TrueBiometric: Trustworthy Biometrics, which accurately detects poisoned images using a majority voting mechanism leveraging multiple state-of-the-art large vision language models. Once identified, poisoned samples are corrected using targeted and calibrated corrective noise. Our extensive empirical results demonstrate that TrueBiometric detects and corrects poisoned images with 100% accuracy without compromising accuracy on clean images. Compared to existing state-of-the-art approaches, TrueBiometric offers a more practical, accurate, and effective solution for mitigating backdoor attacks in face recognition systems.