BackFed: An Efficient & Standardized Benchmark Suite for Backdoor Attacks in Federated Learning

📅 2025-07-07

📈 Citations: 0

✨ Influential: 0

career value

209K/year

🤖 AI Summary

Lack of standardized evaluation protocols for backdoor attacks and defenses in federated learning (FL) hinders fair comparison and practical deployment. To address this, we introduce an efficient, modular benchmarking platform supporting computer vision and natural language processing tasks, multiple model architectures, and diverse experimental configurations. The platform employs multi-process acceleration and a standardized evaluation pipeline, offering plug-and-play interfaces for attack and defense algorithms to ensure reproducible, cross-task and cross-model experiments. Through large-scale empirical studies, we systematically uncover failure modes of 12 state-of-the-art methods under realistic constraints, identifying several previously unreported critical failure scenarios. This work establishes the first reproducible evaluation paradigm for FL backdoor attacks and defenses, providing empirically grounded insights and practical guidelines for designing robust security mechanisms.

Technology Category

Application Category

📝 Abstract

Federated Learning (FL) systems are vulnerable to backdoor attacks, where adversaries train their local models on poisoned data and submit poisoned model updates to compromise the global model. Despite numerous proposed attacks and defenses, divergent experimental settings, implementation errors, and unrealistic assumptions hinder fair comparisons and valid conclusions about their effectiveness in real-world scenarios. To address this, we introduce BackFed - a comprehensive benchmark suite designed to standardize, streamline, and reliably evaluate backdoor attacks and defenses in FL, with a focus on practical constraints. Our benchmark offers key advantages through its multi-processing implementation that significantly accelerates experimentation and the modular design that enables seamless integration of new methods via well-defined APIs. With a standardized evaluation pipeline, we envision BackFed as a plug-and-play environment for researchers to comprehensively and reliably evaluate new attacks and defenses. Using BackFed, we conduct large-scale studies of representative backdoor attacks and defenses across both Computer Vision and Natural Language Processing tasks with diverse model architectures and experimental settings. Our experiments critically assess the performance of proposed attacks and defenses, revealing unknown limitations and modes of failures under practical conditions. These empirical insights provide valuable guidance for the development of new methods and for enhancing the security of FL systems. Our framework is openly available at https://github.com/thinh-dao/BackFed.

Problem

Research questions and friction points this paper is trying to address.

Standardize backdoor attack evaluation in Federated Learning

Address divergent experimental settings and unrealistic assumptions

Provide reliable benchmark for attacks and defenses

Innovation

Methods, ideas, or system contributions that make the work stand out.

Standardized benchmark for FL backdoor attacks

Multi-processing accelerates experimentation speed

Modular design enables easy method integration

🔎 Similar Papers

No similar papers found.

Anthropic

$500,000—$850,000 USD

San Francisco, CA, USA

Research Scientist / Engineer, Foundation Model Evaluation

Apple

Cupertino, United States of America

Research Scientist Intern, Multimodal AI (PhD)