FedPLT: Scalable, Resource-Efficient, and Heterogeneity-Aware Federated Learning via Partial Layer Training

📅 2026-05-04

📈 Citations: 0

✨ Influential: 0

career value

197K/year

🤖 AI Summary

This work proposes FedPLT, a novel federated learning approach designed to address the high communication and computational overhead, strong device heterogeneity, and issues of inconsistent parameter distributions and biased global loss estimation caused by existing partial-parameter training methods. FedPLT employs a structured partial-layer training strategy that adaptively assigns each client a personalized subset of the model based on its resource capacity. By integrating resource-aware model partitioning, hierarchical parameter selection, optimal client sampling, and aggregation optimization, FedPLT achieves performance on par with or superior to FedAvg while using only 18%–29% of trainable parameters. The method significantly reduces the number of straggler clients and demonstrates superior performance in highly heterogeneous environments compared to current state-of-the-art approaches.

📝 Abstract

Federated Learning (FL) has gained significant attention in distributed machine learning by enabling collaborative model training across decentralized system while preserving data privacy. Although extensive research has addressed statistical data heterogeneity, FL still faces several challenges, including high communication and computation overheads and severe device heterogeneity, which require further investigation. Prior work has addressed these issues through sub-model training and partial parameter training. However, such methods often suffer from inconsistent parameter distributions across clients, inaccurate global loss estimation, and increased bias and variance. Guided by our empirical analysis, we propose FedPLT (Federated Learning with Partial Layer Training), an innovative and structured partial parameter training approach that exhibits training behavior similar to full model training while assigning client-specific portions of the model according to their communication and computational capabilities. In addition, we evaluate the performance of FedPLT when combined with optimal client sampling under communication constraints. We show that this integration improves FL performance by reducing sampling variance under the same communication budget. Through extensive experiments, we demonstrate that FedPLT achieves performance comparable to, or even surpassing, that of full-model training (i.e., FedAvg), while requiring significantly fewer trainable parameters per client. Moreover, FedPLT outperforms existing methods in highly heterogeneous environments, effectively adapts to client resource constraints, and reduces the number of straggling clients. In particular, FedPLT reduces the number of trainable parameters by 71%-82% while achieving performance on par with full-model training.

Problem

Research questions and friction points this paper is trying to address.

Federated Learning

Device Heterogeneity

Communication Overhead

Computation Overhead

Partial Model Training

Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Learning

Partial Layer Training

Device Heterogeneity