๐ค AI Summary
Traditional hypergraph incidence matrices employ binary representations, which struggle to capture the complex higher-order relationships between nodes and hyperedges arising from their size disparities. This work proposes the first continuous nodeโhyperedge proximity matrix grounded in a resource allocation mechanism, overcoming the limitations of binary encoding and enabling a more refined modeling of higher-order interactions. The resulting proximity matrix can be seamlessly integrated into lightweight algorithms to effectively support downstream tasks such as link prediction, key node identification, and community detection. Extensive experiments on multiple real-world hypergraph datasets demonstrate that methods leveraging this matrix consistently outperform existing baselines across all three core tasks, confirming both its expressive power and practical utility.
๐ Abstract
Hypergraphs serve as an effective tool widely adopted to characterize higher-order interactions in complex systems. The most intuitive and commonly used mathematical instrument for representing a hypergraph is the incidence matrix, in which each entry is binary, indicating whether the corresponding node belongs to the corresponding hyperedge. Although the incidence matrix has become a foundational tool for hypergraph analysis and mining, we argue that its binary nature is insufficient to accurately capture the complexity of node-hyperedge relationships arising from the fact that different hyperedges can contain vastly different numbers of nodes. Accordingly, based on the resource allocation process on hypergraphs, we propose a continuous-valued matrix to quantify the proximity between nodes and hyperedges. To verify the effectiveness of the proposed proximity matrix, we investigate three important tasks in hypergraph mining: link prediction, vital nodes identification, and community detection. Experimental results on numerous real-world hypergraphs show that simply designed algorithms centered on the proximity matrix significantly outperform benchmark algorithms across these three tasks.