Efficient Partition-based Approaches for Diversified Top-k Subgraph Matching

๐Ÿ“… 2025-11-24
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing top-k diversified subgraph matching methods prioritize vertex coverage but often yield locally clustered results with insufficient topological diversity. This paper formalizes the Distance-Diversified Top-k Subgraph Matching (DTkSM) problem: selecting k pairwise isomorphic matches that maximize topological distance to enhance global structural coverage. To solve it, we propose a partitioned adjacency graph modeling frameworkโ€”the first to incorporate pairwise topological distance as a core diversification criterion. Our approach integrates embedding-driven partition filtering, densest-subgraph-first candidate selection, and distance-diversity optimization, accelerated by efficient subgraph isomorphism checking and partition indexing. Evaluated on 12 real-world datasets, our method achieves up to four orders-of-magnitude speedup over state-of-the-art baselines; 95% of its outputs attain at least 80% of the optimal distance diversity, significantly improving both topological and coverage diversity.

Technology Category

Application Category

๐Ÿ“ Abstract
Subgraph matching is a core task in graph analytics, widely used in domains such as biology, finance, and social networks. Existing top-k diversified methods typically focus on maximizing vertex coverage, but often return results in the same region, limiting topological diversity. We propose the Distance-Diversified Top-k Subgraph Matching (DTkSM) problem, which selects k isomorphic matches with maximal pairwise topological distances to better capture global graph structure. To address its computational challenges, we introduce the Partition-based Distance Diversity (PDD) framework, which partitions the graph and retrieves diverse matches from distant regions. To enhance efficiency, we develop two optimizations: embedding-driven partition filtering and densest-based partition selection over a Partition Adjacency Graph. Experiments on 12 real world datasets show our approach achieves up to four orders of magnitude speedup over baselines, with 95% of results reaching 80% of optimal distance diversity and 100% coverage diversity.
Problem

Research questions and friction points this paper is trying to address.

Maximizes topological distance between subgraph matches for diversity
Addresses computational complexity through graph partitioning techniques
Improves efficiency while maintaining high coverage and distance diversity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Partition-based framework for diverse subgraph matching
Embedding-driven partition filtering to enhance efficiency
Densest-based selection over Partition Adjacency Graph
๐Ÿ”Ž Similar Papers
No similar papers found.
L
Liuyi Chen
Hunan University, Changsha, China
Y
Yuchen Hu
Hunan University, Changsha, China
Z
Zhengyi Yang
The University of New South Wales, Sydney, Australia
X
Xu Zhou
Hunan University, Changsha, China
W
Wenjie Zhang
The University of New South Wales, Sydney, Australia
Kenli Li
Kenli Li
Cheung Kong Professor, Hunan University
High-performance ComputingParallel and Distributed ProcessingAI and Big Data