A Clustering Method with Graph Maximum Decoding Information

📅 2024-03-18
🏛️ IEEE International Joint Conference on Neural Network
📈 Citations: 0
Influential: 0
📄 PDF

career value

227K/year
🤖 AI Summary
Existing graph clustering methods struggle to effectively model node visit uncertainty inherent in random walks and underutilize structural information. To address this, we propose CMDI, a novel framework that reformulates graph clustering as an abstract clustering task aimed at maximizing decoding information (MDI), comprising two sequential phases: graph structure extraction and vertex partitioning. CMDI introduces, for the first time, a two-dimensional structural information theory to explicitly characterize visit uncertainty and integrate prior knowledge. It jointly leverages graph neural networks (GNNs), probabilistic analysis of random walks, and structural information entropy to construct an end-to-end MDI optimization framework. Evaluated on three real-world datasets, CMDI achieves statistically significant improvements over classical baselines in the DI-R metric, demonstrating both high-quality decoding information and computational efficiency—particularly when prior knowledge is incorporated.

Technology Category

Application Category

📝 Abstract
The clustering method based on graph models has garnered increased attention for its widespread applicability across various knowledge domains. Its adaptability to integrate seamlessly with other relevant applications endows the graph model-based clustering analysis with the ability to robustly extract "natural associations" or "graph structures" within datasets, facilitating the modelling of relationships between data points. Despite its efficacy, the current clustering method utilizing the graph-based model overlooks the uncertainty associated with random walk access between nodes and the embedded structural information in the data. To address this gap, we present a novel Clustering method for Maximizing Decoding Information within graph-based models, named CMDI. CMDI innovatively incorporates two-dimensional structural information theory into the clustering process, consisting of two phases: graph structure extraction and graph vertex partitioning. Within CMDI, graph partitioning is reformulated as an abstract clustering problem, leveraging maximum decoding information to minimize uncertainty associated with random visits to vertices. Empirical evaluations on three real-world datasets demonstrate that CMDI outperforms classical baseline methods, exhibiting a superior decoding information ratio (DI-R). Furthermore, CMDI showcases heightened efficiency, particularly when considering prior knowledge (PK). These findings underscore the effectiveness of CMDI in enhancing decoding information quality and computational efficiency, positioning it as a valuable tool in graph-based clustering analyses.
Problem

Research questions and friction points this paper is trying to address.

Improves graph-based clustering by addressing uncertainty in random walks
Incorporates structural information theory to enhance decoding information
Optimizes graph partitioning to minimize vertex visit uncertainty
Innovation

Methods, ideas, or system contributions that make the work stand out.

Incorporates two-dimensional structural information theory
Reformulates graph partitioning as clustering problem
Maximizes decoding information to minimize uncertainty
🔎 Similar Papers
No similar papers found.
X
Xinrun Xu
University of Chinese Academy of Sciences, Beijing, China; Institute of Software, Chinese Academy of Sciences, Beijing, China
M
Manying Lv
University of Chinese Academy of Sciences, Beijing, China; Institute of Software, Chinese Academy of Sciences, Beijing, China
Y
Yurong Wu
University of Chinese Academy of Sciences, Beijing, China; Institute of Software, Chinese Academy of Sciences, Beijing, China
Z
Zhanbiao Lian
University of Chinese Academy of Sciences, Beijing, China; Institute of Software, Chinese Academy of Sciences, Beijing, China
J
Jin Yan
Advanced Institute of Big Data (Beijing), Beijing, China
S
Shan Jiang
University of Chinese Academy of Sciences, Beijing, China; Institute of Software, Chinese Academy of Sciences, Beijing, China
Z
Zhiming Ding
University of Chinese Academy of Sciences, Beijing, China; Institute of Software, Chinese Academy of Sciences, Beijing, China