๐ค AI Summary
Deploying large-scale AI models via federated learning (FL) on resource-constrained edge devices remains challenging; existing split FL (SFL) approaches are largely confined to two-tier architectures and fail to exploit heterogeneous multi-tier cloudโedgeโdevice resources. Method: We propose Hierarchical Split Federated Learning (HSFL), the first framework supporting flexible, heterogeneous multi-level system deployments. Theoretically, we derive the first convergence upper bound for HSFL. Methodologically, we jointly optimize model partitioning locations and multi-tier aggregation strategies, and design a decomposable iterative descent algorithm to achieve Pareto-optimal trade-offs among accuracy, communication, and computation overhead. Results: Extensive simulations demonstrate that HSFL significantly improves convergence speed and model accuracy across arbitrary multi-tier topologies, while achieving superior communication and computational resource utilization compared to state-of-the-art SFL methods.
๐ Abstract
As AI models expand in size, it has become increasingly challenging to deploy federated learning (FL) on resource-constrained edge devices. To tackle this issue, split federated learning (SFL) has emerged as an FL framework with reduced workload on edge devices via model splitting; it has received extensive attention from the research community in recent years. Nevertheless, most prior works on SFL focus only on a two-tier architecture without harnessing multi-tier cloudedge computing resources. In this paper, we intend to analyze and optimize the learning performance of SFL under multi-tier systems. Specifically, we propose the hierarchical SFL (HSFL) framework and derive its convergence bound. Based on the theoretical results, we formulate a joint optimization problem for model splitting (MS) and model aggregation (MA). To solve this rather hard problem, we then decompose it into MS and MA subproblems that can be solved via an iterative descending algorithm. Simulation results demonstrate that the tailored algorithm can effectively optimize MS and MA for SFL within virtually any multi-tier system.