๐ค AI Summary
Over-the-air (OTA) federated learning for energy-harvesting devices suffers from convergence bias and communication redundancy due to channel fading, energy constraints, and data heterogeneity. To address these challenges, we propose an entropy-driven dual-paradigm scheduling framework: entropy-optimized scheduling for known data distributions, andโwhen distributions are unknownโa least-squares-based user representation model enabling adaptive client selection, thereby pioneering the integration of diversity-awareness into energy-efficient collaborative updating. Our framework jointly optimizes OTA aggregation, fine-grained energy harvesting modeling, and fading channel adaptation. Theoretical analysis guarantees convergence, while experiments demonstrate significant improvements over baseline methods: 32% reduction in communication rounds, 27% lower energy consumption, and a 4.8% gain in global model accuracy. Moreover, the approach exhibits enhanced robustness against data skewness and channel fluctuations.
๐ Abstract
We study over-the-air (OTA) federated learning (FL) for energy harvesting devices with heterogeneous data distribution over wireless fading multiple access channel (MAC). To address the impact of low energy arrivals and data heterogeneity on global learning, we propose user scheduling strategies. Specifically, we develop two approaches: 1) entropy-based scheduling for known data distributions and 2) least-squares-based user representation estimation for scheduling with unknown data distributions at the parameter server. Both methods aim to select diverse users, mitigating bias and enhancing convergence. Numerical and analytical results demonstrate improved learning performance by reducing redundancy and conserving energy.