🤖 AI Summary
This work addresses the challenges of slow convergence and high communication and memory overhead in federated learning under heterogeneous and resource-constrained settings. We propose a federated optimization method that integrates stochastic low-rank subspace projection with momentum mechanisms. By applying low-rank compression to client-side gradient updates, our approach significantly reduces communication volume and memory footprint while preserving model accuracy. Leveraging non-convex optimization theory, we provide rigorous convergence guarantees for the proposed algorithm. Experimental results on heterogeneous MNIST tasks demonstrate that our method achieves accuracy comparable to or better than FedAvg and existing sparsity- or low-rank-based baselines, while substantially lowering both communication and memory costs.
📝 Abstract
Federated learning enables a population of clients to collaboratively train machine learning models without exchanging their raw data, but standard algorithms such as FedAvg suffer from slow convergence and high communication and memory costs in heterogeneous, resource-constrained environments. We introduce FedSLoP, a federated optimization algorithm that combines stochastic low-rank subspace projections of gradients, thereby reducing the dimension of communicated and stored updates while preserving optimization progress. On the theoretical side, we develop a detailed nonconvex convergence analysis under standard smoothness and bounded-variance assumptions, showing that FedSLoP is guaranteed to converge to a first-order stationary point at a rate of $O(1/\sqrt{NT})$. On the empirical side, we conduct extensive experiments on federated MNIST classification with heterogeneous data partitions, showing that FedSLoP substantially reduces communication volume and client-side memory while achieving competitive or better accuracy compared with FedAvg and representative sparse or low-rank baselines. Together, our results demonstrate that random subspace momentum methods such as FedSLoP provide a principled and effective approach to communication- and memory-efficient federated learning. Codes are available at: https://github.com/pkumelon/FedSLoP.git.