Stochastic Approximation Approach to Federated Machine Learning

πŸ“… 2024-02-20
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

216K/year
πŸ€– AI Summary
Federated learning (FL) suffers from weak and only expectation-based convergence under non-independent and identically distributed (Non-IID) client data. Method: This paper introduces stochastic approximation (SA) theory into FL and proposes a client-adaptive decaying step-size mechanism, where local update step sizes decrease heterogeneously across clients and iterations, ensuring almost-sure convergence of the global model. Contribution/Results: It is the first work to systematically embed the SA framework into FL, rigorously proving that the aggregation weights asymptotically track an autonomous ordinary differential equation (ODE), thereby establishing a solid theoretical foundation for convergence. The method dynamically adjusts each client’s contribution to the global model, enhancing robustness to data rarity. Experiments under Non-IID settings demonstrate superior convergence stability and higher final accuracy compared to FedAvg and FedProx, validating both the theoretical tracking property and strong generalization capability.

Technology Category

Application Category

πŸ“ Abstract
This paper examines Federated learning (FL) in a Stochastic Approximation (SA) framework. FL is a collaborative way to train neural network models across various participants or clients without centralizing their data. Each client will train a model on their respective data and send the weights across to a the server periodically for aggregation. The server aggregates these weights which are then used by the clients to re-initialize their neural network and continue the training. SA is an iterative algorithm that uses approximate sample gradients and tapering step size to locate a minimizer of a cost function. In this paper the clients use a stochastic approximation iterate to update the weights of its neural network. It is shown that the aggregated weights track an autonomous ODE. Numerical simulations are performed and the results are compared with standard algorithms like FedAvg and FedProx. It is observed that the proposed algorithm is robust and gives more reliable estimates of the weights, in particular when the clients data are not identically distributed.
Problem

Research questions and friction points this paper is trying to address.

Develops client-specific step sizes for federated learning convergence
Analyzes global model dynamics using stochastic approximation framework
Regulates client influence through step size ratios for data heterogeneity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Client-specific tapering step sizes for model training
Global model tracks ODE with weighted gradient sum
Clients with larger step size ratios exert greater influence
πŸ”Ž Similar Papers
No similar papers found.