CCFC: Bridging Federated Clustering and Contrastive Learning

๐Ÿ“… 2024-01-12
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 3
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the challenge of unsupervised clustering of high-dimensional data in federated learning, this paper proposes FedClusterโ€”the first federated clustering framework integrating contrastive representation learning. Methodologically, it introduces cluster-level contrastive loss to align global cluster structures across clients, and designs a robust local cluster center alignment mechanism coupled with adaptive aggregation to mitigate client heterogeneity and dropouts. The framework strictly preserves data privacy by confining raw data to local devices, exchanging only model parameters and cluster centroids. Evaluated on multiple benchmark datasets, FedCluster achieves a maximum NMI improvement of 0.4155 and outperforms the best baseline by over 100% in clustering accuracy. Notably, it maintains stable performance even under 30% client dropout, demonstrating strong resilience to client unavailability.

Technology Category

Application Category

๐Ÿ“ Abstract
Federated clustering, an essential extension of centralized clustering for federated scenarios, enables multiple data-holding clients to collaboratively group data while keeping their data locally. In centralized scenarios, clustering driven by representation learning has made significant advancements in handling high-dimensional complex data. However, the combination of federated clustering and representation learning remains underexplored. To bridge this, we first tailor a cluster-contrastive model for learning clustering-friendly representations. Then, we harness this model as the foundation for proposing a new federated clustering method, named cluster-contrastive federated clustering (CCFC). Benefiting from representation learning, the clustering performance of CCFC even double those of the best baseline methods in some cases. Compared to the most related baseline, the benefit results in substantial NMI score improvements of up to 0.4155 on the most conspicuous case. Moreover, CCFC also shows superior performance in handling device failures from a practical viewpoint.
Problem

Research questions and friction points this paper is trying to address.

Combining federated clustering with representation learning
Improving clustering performance in federated scenarios
Enhancing robustness against device failures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cluster-contrastive model for representation learning
New federated clustering method CCFC
Handles device failures with superior performance
๐Ÿ”Ž Similar Papers
No similar papers found.
Jie Yan
Jie Yan
jieyan@amss.ac.cn
deep generative modelsclustering
J
Jing Liu
Central University of Finance and Economics, P.R.China
Z
Zhonghan Zhang
Central University of Finance and Economics, P.R.China