CCFC: Bridging Federated Clustering and Contrastive Learning

πŸ“… 2024-01-12
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 3
✨ Influential: 0
πŸ“„ PDF

career value

236K/year
πŸ€– AI Summary
To address the challenge of unsupervised clustering of high-dimensional data in federated learning, this paper proposes FedClusterβ€”the first federated clustering framework integrating contrastive representation learning. Methodologically, it introduces cluster-level contrastive loss to align global cluster structures across clients, and designs a robust local cluster center alignment mechanism coupled with adaptive aggregation to mitigate client heterogeneity and dropouts. The framework strictly preserves data privacy by confining raw data to local devices, exchanging only model parameters and cluster centroids. Evaluated on multiple benchmark datasets, FedCluster achieves a maximum NMI improvement of 0.4155 and outperforms the best baseline by over 100% in clustering accuracy. Notably, it maintains stable performance even under 30% client dropout, demonstrating strong resilience to client unavailability.

Technology Category

Application Category

πŸ“ Abstract
Federated clustering, an essential extension of centralized clustering for federated scenarios, enables multiple data-holding clients to collaboratively group data while keeping their data locally. In centralized scenarios, clustering driven by representation learning has made significant advancements in handling high-dimensional complex data. However, the combination of federated clustering and representation learning remains underexplored. To bridge this, we first tailor a cluster-contrastive model for learning clustering-friendly representations. Then, we harness this model as the foundation for proposing a new federated clustering method, named cluster-contrastive federated clustering (CCFC). Benefiting from representation learning, the clustering performance of CCFC even double those of the best baseline methods in some cases. Compared to the most related baseline, the benefit results in substantial NMI score improvements of up to 0.4155 on the most conspicuous case. Moreover, CCFC also shows superior performance in handling device failures from a practical viewpoint.
Problem

Research questions and friction points this paper is trying to address.

Combining federated clustering with representation learning
Improving clustering performance in federated scenarios
Enhancing robustness against device failures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cluster-contrastive model for representation learning
New federated clustering method CCFC
Handles device failures with superior performance
πŸ”Ž Similar Papers
2024-10-04IEEE International Symposium on Network Computing and ApplicationsCitations: 3