π€ AI Summary
To address inefficient training in multi-device federated learning caused by resource-constrained edge devices, this paper proposes Centaurβa novel framework that introduces an on-demand client-side data selection mechanism at the edge and pioneers a coupled paradigm of data selection and model sharding for single-user, multi-device scenarios. Centaur jointly optimizes challenges arising from non-IID data distributions, device heterogeneity, and dynamic mobility. It enables collaborative deep neural network (DNN) model sharding across lightweight and resource-rich devices, significantly improving resource utilization. Extensive experiments across five neural architectures and six benchmark datasets demonstrate an average 19% improvement in classification accuracy and a 58% reduction in federated training latency. The implementation is open-sourced to advance research in decentralized federated learning.
π Abstract
Ubiquitous wearable and mobile devices provide access to a diverse set of data. However, the mobility demand for our devices naturally imposes constraints on their computational and communication capabilities. A solution is to locally learn knowledge from data captured by ubiquitous devices, rather than to store and transmit the data in its original form. In this paper, we develop a federated learning framework, called Centaur, to incorporate on-device data selection at the edge, which allows partition-based training of a deep neural nets through collaboration between constrained and resourceful devices within the multidevice ecosystem of the same user. We benchmark on five neural net architecture and six datasets that include image data and wearable sensor time series. On average, Centaur achieves ~19% higher classification accuracy and ~58% lower federated training latency, compared to the baseline. We also evaluate Centaur when dealing with imbalanced non-iid data, client participation heterogeneity, and different mobility patterns. To encourage further research in this area, we release our code at https://github.com/nokia-bell-labs/data-centric-federated-learning