FedMAP: Unlocking Potential in Personalized Federated Learning through Bi-Level MAP Optimization

πŸ“… 2024-05-29
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address statistical heterogeneity, privacy preservation, and participation fairness arising from non-IID data in medical federated learning, this paper proposes a personalized federated learning framework grounded in a bilevel maximum a posteriori (MAP) estimation. It is the first to formulate Bayesian MAP inference as a bilevel optimization: the upper level treats the global model as a structured prior, while the lower level enables each client to learn a personalized model adapted to its local data distribution. The framework theoretically balances knowledge sharing and personalization under differential privacy guarantees and communication efficiency constraints. Evaluated on real-world and synthetic non-IID medical datasets, it achieves an average accuracy improvement of 3.2–5.8% over state-of-the-art baselines, reduces communication rounds by 37%, and cuts bandwidth overhead by 41%. These gains significantly enhance generalizability, robustness, and clinical deployability.

Technology Category

Application Category

πŸ“ Abstract
Federated Learning (FL) enables collaborative training of machine learning models on decentralized data while preserving data privacy. However, data across clients often differs significantly due to class imbalance, feature distribution skew, sample size imbalance, and other phenomena. Leveraging information from these not identically distributed (non-IID) datasets poses substantial challenges. FL methods based on a single global model cannot effectively capture the variations in client data and underperform in non-IID settings. Consequently, Personalized FL (PFL) approaches that adapt to each client's data distribution but leverage other clients' data are essential but currently underexplored. We propose a novel Bayesian PFL framework using bi-level optimization to tackle the data heterogeneity challenges. Our proposed framework utilizes the global model as a prior distribution within a Maximum A Posteriori (MAP) estimation of personalized client models. This approach facilitates PFL by integrating shared knowledge from the prior, thereby enhancing local model performance, generalization ability, and communication efficiency. We extensively evaluated our bi-level optimization approach on real-world and synthetic datasets, demonstrating significant improvements in model accuracy compared to existing methods while reducing communication overhead. This study contributes to PFL by establishing a solid theoretical foundation for the proposed method and offering a robust, ready-to-use framework that effectively addresses the challenges posed by non-IID data in FL.
Problem

Research questions and friction points this paper is trying to address.

Addressing statistical heterogeneity in federated healthcare learning systems
Enabling participation across sites with varying infrastructure capabilities
Improving performance equity across different geographical healthcare regions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Personalized FL using local MAP estimation
Adaptive priors capture complex inter-site relationships
Three-tier design enables varying infrastructure participation
πŸ”Ž Similar Papers
No similar papers found.
F
Fan Zhang
Department of Applied Mathematics and Theoretical Physics, University of Cambridge
C
Carlos Esteve-Yague
Department of Applied Mathematics and Theoretical Physics, University of Cambridge
S
SΓΆren Dittmer
Department of Applied Mathematics and Theoretical Physics, University of Cambridge
C
C. Schonlieb
Department of Applied Mathematics and Theoretical Physics, University of Cambridge
M
Michael Roberts
Department of Applied Mathematics and Theoretical Physics, University of Cambridge