One-Shot Federated Clustering of Non-Independent Completely Distributed Data

๐Ÿ“… 2026-01-24
๐Ÿ›๏ธ IEEE Internet of Things Journal
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the performance bottleneck in unsupervised federated clustering caused by data fragmentation among non-independent and identically distributed (Non-IID) clients, and introduces a more general challenge termed โ€œnon-independent completely distributedโ€ (Non-ICD). To tackle this, the authors propose GOLD, the first single-round communication federated clustering framework, which models incomplete local cluster distributions, uploads lightweight distribution summaries to the server, fuses them into a global distribution, and uses this global view to guide local clustering refinement. Experimental results demonstrate that GOLD significantly outperforms existing methods across diverse Non-ICD settings, while maintaining high efficiency, scalability, and strong privacy preservation.

Technology Category

Application Category

๐Ÿ“ Abstract
Federated Learning (FL) that extracts data knowledge while protecting the privacy of multiple clients has achieved remarkable results in distributed privacy-preserving IoT systems, including smart traffic flow monitoring, smart grid load balancing, and so on. Since most data collected from edge devices are unlabeled, unsupervised Federated Clustering (FC) is becoming increasingly popular for exploring pattern knowledge from complex distributed data. However, due to the lack of label guidance, the common Non-Independent and Identically Distributed (Non-IID) issue of clients have greatly challenged FC by posing the following problems: How to fuse pattern knowledge (i.e., cluster distribution) from Non-IID clients; How are the cluster distributions among clients related; and How does this relationship connect with the global knowledge fusion? In this paper, a more tricky but overlooked phenomenon in Non-IID is revealed, which bottlenecks the clustering performance of the existing FC approaches. That is, different clients could fragment a cluster, and accordingly, a more generalized Non-IID concept, i.e., Non-ICD (Non-Independent Completely Distributed), is derived. To tackle the above FC challenges, a new framework named GOLD (Global Oriented Local Distribution Learning) is proposed. GOLD first finely explores the potential incomplete local cluster distributions of clients, then uploads the distribution summarization to the server for global fusion, and finally performs local cluster enhancement under the guidance of the global distribution. Extensive experiments, including significance tests, ablation studies, scalability evaluations, qualitative results, etc., have been conducted to show the superiority of GOLD.
Problem

Research questions and friction points this paper is trying to address.

Federated Clustering
Non-IID
Non-ICD
Unsupervised Learning
Data Heterogeneity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Clustering
Non-ICD
GOLD
One-Shot Learning
Unsupervised Federated Learning
๐Ÿ”Ž Similar Papers
No similar papers found.
Yiqun Zhang
Yiqun Zhang
Northeastern University, China; Shanghai Artificial Intelligence Laboratory
empathetic dialogueLLM-based agentMulti-agent
S
Shenghong Cai
Department of Computer Science, Beijing Normal-Hong Kong Baptist University, Zhuhai 519087, Guangdong, China
Z
Zihua Yang
School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
S
Sen Feng
School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
Y
Yuzhu Ji
School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
Haijun Zhang
Haijun Zhang
Professor, IEEE Fellow, University of Science and Technology Beijing
6GAI enabled Wireless CommunicationsResource AllocationMobility Management