Attributed Graph Clustering in Collaborative Settings

📅 2024-11-19
🏛️ IEEE Transactions on Dependable and Secure Computing
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address low clustering efficiency and poor collaboration under data silos, this paper proposes a vertically partitioned attributed graph collaborative clustering framework: each party holds only a subset of node attributes and collaboratively performs clustering without sharing raw data, exchanging only gradients or embedding summaries. We innovatively design a sample-space-reduction-based collaborative optimization mechanism and, for the first time, theoretically prove—under a proximity condition—that local convergence guarantees global convergence. The method integrates graph neural networks, distributed optimization, and privacy-preserving techniques. Evaluated on four public datasets, it achieves an average 1.2% improvement in normalized mutual information (NMI), matching centralized methods’ performance, while reducing communication overhead by 37% and accelerating training by 2.1×.

Technology Category

Application Category

📝 Abstract
Graph clustering is an unsupervised machine learning method that partitions the nodes in a graph into different groups. Despite achieving significant progress in exploiting both attributed and structured data information, graph clustering methods often face practical challenges related to data isolation. Moreover, the absence of collaborative methods for graph clustering limits their effectiveness. In this paper, we propose a collaborative graph clustering framework for attributed graphs, supporting attributed graph clustering over vertically partitioned data with different participants holding distinct features of the same data. Our method leverages a novel technique that reduces the sample space, improving the efficiency of the attributed graph clustering method. Furthermore, we compare our method to its centralized counterpart under a proximity condition, demonstrating that the successful local results of each participant contribute to the overall success of the collaboration. We fully implement our approach and evaluate its utility and efficiency by conducting experiments on four public datasets. The results demonstrate that our method achieves comparable accuracy levels to centralized attributed graph clustering methods. Our collaborative graph clustering framework provides an efficient and effective solution for graph clustering challenges related to data isolation.
Problem

Research questions and friction points this paper is trying to address.

Distributed Data
Graph Clustering
Collaborative Methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Distributed Graph Clustering
Efficiency Enhancement
Data Dispersal Problem
🔎 Similar Papers
No similar papers found.
R
Rui Zhang
School of Mathematical Sciences, Zhejiang University, Hang Zhou, China
Xiaoyang Hou
Xiaoyang Hou
Zhejiang University
Zhihua Tian
Zhihua Tian
Zhejiang Univrsity
Truthworth AI Generative Model Federated Learning
Y
Yan He
AntGroup, Hangzhou, China
E
Enchao Gong
AntGroup, Hangzhou, China
J
Jian Liu
School of Cyber Science and Technology, Zhejiang University, Hang Zhou, China
Q
Qing Wu
School of Mathematical Sciences, Zhejiang University, Hang Zhou, China
Kui Ren
Kui Ren
Professor and Dean of Computer Science, Zhejiang University, ACM/IEEE Fellow
Data Security & PrivacyAI SecurityIoT & Vehicular Security