gMatch: Fine-Grained and Hardware-Efficient Subgraph Matching on GPUs

📅 2026-04-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

224K/year
🤖 AI Summary
Existing GPU-based subgraph matching approaches are hindered by coarse-grained execution models, resulting in high memory overhead, low thread utilization, and poor scalability. This work proposes a fine-grained GPU subgraph matching method that introduces warp-level batch exploration, lightweight dynamic load balancing, and fine-grained task scheduling to significantly improve hardware utilization and reduce memory consumption. Experimental results demonstrate that the proposed approach consistently outperforms state-of-the-art methods—including STMatch, T-DFS, and EGSM—across diverse real-world datasets and query patterns, while exhibiting superior capability in handling large-scale queries and data volumes.

Technology Category

Application Category

📝 Abstract
Subgraph matching is a core operation in graph analytics, supporting a broad spectrum of applications from social network analysis to bioinformatics. Recent GPU-based approaches accelerate subgraph matching by leveraging parallelism but rely on a coarse-grained execution model that suffers from scalability and efficiency issues due to high memory overhead and thread underutilization. In this paper, we propose gMatch, a hardware-efficient subgraph matching approach on GPUs. gMatch introduces a fine-grained execution model that reduces memory consumption and enables flexible task scheduling among threads. We further design warp-level batch exploration and lightweight load balancing to improve execution efficiency and scalability. Experiments on diverse workloads and real-world datasets show that gMatch outperforms state-of-the-art subgraph matching methods, including STMatch, T-DFS, and EGSM, in both performance and scalability. We also compare against state-of-the-art systems for mining small patterns, such as BEEP and G$^2$Miner. While these systems achieve better performance on small datasets, gMatch scales to substantially larger queries and datasets, where existing approaches degrade or fail to complete.
Problem

Research questions and friction points this paper is trying to address.

subgraph matching
GPU
scalability
memory overhead
thread underutilization
Innovation

Methods, ideas, or system contributions that make the work stand out.

fine-grained execution
GPU acceleration
subgraph matching
warp-level batch exploration
load balancing
🔎 Similar Papers
2023-09-27Proceedings of the VLDB EndowmentCitations: 21