Personalized Federated Learning of Probabilistic Models: A PAC-Bayesian Approach

📅 2024-01-16
🏛️ arXiv.org
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
In federated learning, highly heterogeneous and scarce client data impede effective personalized modeling. Method: We propose PAC-PFL, a personalized federated learning algorithm grounded in the PAC-Bayesian framework that jointly learns a shared hyper-posterior distribution, enabling Bayesian personalized posterior inference per client. PAC-PFL is the first to integrate PAC-Bayesian generalization bounds with differential privacy to handle data-dependent priors, unifies global collaboration and local adaptation via hyper-posterior modeling, and combines variational Bayesian inference, federated hyperparameter learning, and Dirichlet-based non-IID data partitioning. Results: Experiments on photovoltaic power forecasting, FEMNIST, and Dirichlet-EMNIST demonstrate that PAC-PFL reduces average prediction error by 12.7% over baselines and improves expected calibration error (ECE) by over 40%, significantly enhancing generalization and predictive calibration—especially for clients with limited data.

Technology Category

Application Category

📝 Abstract
Federated learning aims to infer a shared model from private and decentralized data stored locally by multiple clients. Personalized federated learning (PFL) goes one step further by adapting the global model to each client, enhancing the model's fit for different clients. A significant level of personalization is required for highly heterogeneous clients, but can be challenging to achieve especially when they have small datasets. To address this problem, we propose a PFL algorithm named PAC-PFL for learning probabilistic models within a PAC-Bayesian framework that utilizes differential privacy to handle data-dependent priors. Our algorithm collaboratively learns a shared hyper-posterior and regards each client's posterior inference as the personalization step. By establishing and minimizing a generalization bound on the average true risk of clients, PAC-PFL effectively combats over-fitting. PACPFL achieves accurate and well-calibrated predictions, supported by experiments on a dataset of photovoltaic panel power generation, FEMNIST dataset (Caldas et al., 2019), and Dirichlet-partitioned EMNIST dataset (Cohen et al., 2017).
Problem

Research questions and friction points this paper is trying to address.

Enhances model fit for heterogeneous clients in federated learning
Addresses overfitting in data-poor personalized federated learning scenarios
Provides generalization bounds for new clients in probabilistic models
Innovation

Methods, ideas, or system contributions that make the work stand out.

PAC-PFL infers shared hyper-posterior for personalization
Minimizes PAC-Bayesian bound to prevent overfitting
Provides generalization bounds for new clients
🔎 Similar Papers
No similar papers found.
M
M. G. Boroujeni
Institute of Mechanical Engineering, EPFL, Switzerland
A
Andreas Krause
Department of Computer Science, ETH Zürich, Switzerland
G
Giancarlo Ferrari-Trecate
Institute of Mechanical Engineering, EPFL, Switzerland