π€ AI Summary
To address client data heterogeneity, energy constraints, and dynamic disconnections in federated learning for IoT time-series forecasting, this paper proposes FedDeCABβa semi-decentralized client selection and collaborative optimization framework. Its core innovations include: (i) a probabilistic ranking mechanism for efficient client selection, and (ii) localized model sharing and joint optimization among neighboring clients upon disconnection, thereby mitigating the adverse impact of availability fluctuations on convergence. FedDeCAB maintains low communication overhead while significantly improving global model convergence speed and generalization performance. Extensive experiments on large-scale taxi and vessel trajectory datasets demonstrate that, under highly heterogeneous and dynamic settings, FedDeCAB reduces communication cost by 32.7% on average and accelerates convergence by 1.8Γ compared to baseline methods, validating its robustness and practicality for real-world IoT deployments.
π Abstract
Federated learning (FL) effectively promotes collaborative training among distributed clients with privacy considerations in the Internet of Things (IoT) scenarios. Despite of data heterogeneity, FL clients may also be constrained by limited energy and availability budgets. Therefore, effective selection of clients participating in training is of vital importance for the convergence of the global model and the balance of client contributions. In this paper, we discuss the performance impact of client availability with time-series data on federated learning. We set up three different scenarios that affect the availability of time-series data and propose FedDeCAB, a novel, semi-decentralized client selection method applying probabilistic rankings of available clients. When a client is disconnected from the server, FedDeCAB allows obtaining partial model parameters from the nearest neighbor clients for joint optimization, improving the performance of offline models and reducing communication overhead. Experiments based on real-world large-scale taxi and vessel trajectory datasets show that FedDeCAB is effective under highly heterogeneous data distribution, limited communication budget, and dynamic client offline or rejoining.