Federated Learning for Medical Image Classification: A Comprehensive Benchmark

📅 2025-04-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the instability of algorithm performance and the lack of standardized evaluation benchmarks in medical image federated learning (FL), this paper establishes the first unified FL benchmark encompassing diverse, real-world multimodal medical imaging datasets. It systematically evaluates mainstream FL algorithms—including FedAvg, FedProx, and SCAFFOLD—under challenging non-IID, few-shot, and cross-institution heterogeneous settings, assessing their accuracy, communication efficiency, and computational overhead. Methodologically, we propose a novel data augmentation framework integrating denoising diffusion probabilistic models (DDPMs) with label smoothing to enhance model robustness and generalization. Experimental results demonstrate that our approach achieves an average 4.2% improvement in classification accuracy across six medical imaging datasets and reduces convergence rounds by 31%. The complete codebase and benchmark are publicly released to foster reproducible research and fair comparison in medical FL.

Technology Category

Application Category

📝 Abstract
The federated learning paradigm is wellsuited for the field of medical image analysis, as it can effectively cope with machine learning on isolated multicenter data while protecting the privacy of participating parties. However, current research on optimization algorithms in federated learning often focuses on limited datasets and scenarios, primarily centered around natural images, with insufficient comparative experiments in medical contexts. In this work, we conduct a comprehensive evaluation of several state-of-the-art federated learning algorithms in the context of medical imaging. We conduct a fair comparison of classification models trained using various federated learning algorithms across multiple medical imaging datasets. Additionally, we evaluate system performance metrics, such as communication cost and computational efficiency, while considering different federated learning architectures. Our findings show that medical imaging datasets pose substantial challenges for current federated learning optimization algorithms. No single algorithm consistently delivers optimal performance across all medical federated learning scenarios, and many optimization algorithms may underperform when applied to these datasets. Our experiments provide a benchmark and guidance for future research and application of federated learning in medical imaging contexts. Furthermore, we propose an efficient and robust method that combines generative techniques using denoising diffusion probabilistic models with label smoothing to augment datasets, widely enhancing the performance of federated learning on classification tasks across various medical imaging datasets. Our code will be released on GitHub, offering a reliable and comprehensive benchmark for future federated learning studies in medical imaging.
Problem

Research questions and friction points this paper is trying to address.

Evaluating federated learning algorithms for medical image classification
Addressing performance challenges in medical federated learning scenarios
Proposing a robust method to enhance federated learning performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Evaluates federated learning algorithms for medical imaging
Proposes generative techniques with diffusion models for data augmentation
Benchmarks performance across diverse medical imaging datasets
🔎 Similar Papers
No similar papers found.
Z
Zhekai Zhou
School of Electronic and Computer Engineering, Peking University
Guibo Luo
Guibo Luo
Peking University
medical imagingprivacy computing
M
Mingzhi Chen
School of Electronic and Computer Engineering, Peking University
Z
Zhenyu Weng
Shien-Ming Wu School of Intelligent Engineering, South China University of Technology
Yuesheng Zhu
Yuesheng Zhu
Peking University
Intelligent security