FedCod: An Efficient Communication Protocol for Cross-Silo Federated Learning with Coding

📅 2024-12-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address communication bottlenecks in geographically distributed federated learning (FL) caused by network heterogeneity and spatial dispersion, this work proposes an FL-algorithm-agnostic application-layer communication protocol. The protocol innovatively integrates network coding—specifically random linear coding—at the protocol layer, coupled with a client-to-client peer-to-peer (P2P) topology, real-time network state awareness, and dynamic redundancy control, enabling bandwidth-adaptive transmission. Crucially, it preserves local training logic and model accuracy without modification. Experiments demonstrate a 62% reduction in average communication time, zero degradation in end-to-end training performance, and significant reduction in total inter-client communication traffic. To the best of our knowledge, this is the first work to systematically unify coding, P2P networking, and adaptive control at the FL communication protocol level, establishing a lightweight, general-purpose, and practically effective communication optimization paradigm for distributed collaborative learning under high-latency and low-bandwidth conditions.

Technology Category

Application Category

📝 Abstract
Federated Learning (FL) is an innovative distributed machine learning paradigm that enables multiple parties to collaboratively train a model without sharing their raw data, thereby preserving data privacy. Communication efficiency concerns arise in cross-silo FL, particularly due to the network heterogeneity and fluctuations associated with geo-distributed silos. Most existing solutions to these problems focus on algorithmic improvements that alter the FL algorithm but sacrificing the training performance. How to address these problems from a network perspective that is decoupled from the FL algorithm remains an open challenge. In this paper, we propose FedCod, a new application layer communication protocol designed for cross-silo FL. FedCod transparently utilizes a coding mechanism to enhance the efficient use of idle bandwidth through client-to-client communication, and dynamically adjusts coding redundancy to mitigate network bottlenecks and fluctuations, thereby improving the communication efficiency and accelerating the training process. In our real-world experiments, FedCod demonstrates a significant reduction in average communication time by up to 62% compared to the baseline, while maintaining FL training performance and optimizing inter-client communication traffic.
Problem

Research questions and friction points this paper is trying to address.

Federated Learning
Network Efficiency
Cross-Regional Learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

FedCod
Coding Techniques
Efficient Communication
🔎 Similar Papers
No similar papers found.
Peishen Yan
Peishen Yan
Shanghai Jiao Tong University
Federated LearningLLM Fine-Tuning
J
Jun Li
City University of New York, New York, NY, USA
H
Hao Wang
Stevens Institute of Technology, Hoboken, USA
T
Tao Song
Shanghai Jiao Tong University, Shanghai, China
Y
Yang Hua
Queen’s University Belfast, Belfast, UK
L
Lu Peng
Tulane University, New Orleans, LA, USA
H
Haihui Zhou
Shanghai Jiao Tong University, Shanghai, China
H
Haibing Guan
Shanghai Jiao Tong University, Shanghai, China