Accelerating Maximal Biclique Enumeration on GPUs

πŸ“… 2024-01-10
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
The maximal biclique enumeration (MBE) problem on large-scale graphs suffers from high computational complexity and poor scalability. To address this, we propose cuMBEβ€”the first efficient GPU-parallel algorithm for MBE. Our method eliminates recursion entirely via a compact array-based data structure and synergistically combines coarse-grained task partitioning, fine-grained intra-thread optimizations, and dynamic work-stealing to mitigate load imbalance and memory overhead bottlenecks. By leveraging GPU parallelism and thread-block-level cooperative optimization, cuMBE achieves geometric mean speedups of 4.02Γ— and 4.13Γ— over the best sequential and multicore CPU algorithms, respectively, on both synthetic and real-world datasets. This advancement significantly enhances the scalability and practical applicability of MBE computation.

Technology Category

Application Category

πŸ“ Abstract
Maximal Biclique Enumeration (MBE) holds critical importance in graph theory with applications extending across fields such as bioinformatics, social networks, and recommendation systems. However, its computational complexity presents barriers for efficiently scaling to large graphs. To address these challenges, we introduce cuMBE, a GPU-optimized parallel algorithm for MBE. Utilizing a unique data structure, called compact array, cuMBE eradicates the need for recursion, thereby significantly minimizing dynamic memory requirements and computational overhead. The algorithm utilizes a hybrid parallelism approach, in which GPU thread blocks handle coarse-grained tasks associated with part of the search process. Besides, we implement three fine-grained optimizations within each thread block to enhance performance. Further, we integrate a work-stealing mechanism to mitigate workload imbalances among thread blocks. Our experiments reveal that cuMBE achieves an geometric mean speedup of 4.02x and 4.13x compared to the state-of-the-art serial algorithm and parallel CPU-based algorithm on both common and real-world datasets, respectively.
Problem

Research questions and friction points this paper is trying to address.

Accelerating Maximal Biclique Enumeration for large graphs
Reducing computational complexity with GPU-optimized parallel algorithm
Addressing workload imbalances in parallel processing for MBE
Innovation

Methods, ideas, or system contributions that make the work stand out.

GPU-optimized parallel algorithm for MBE
Compact array eliminates recursion and memory overhead
Hybrid parallelism with work-stealing for load balance
πŸ”Ž Similar Papers
No similar papers found.
C
Chou-Ying Hsieh
Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan
C
Chia-Ming Chang
Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan
P
Po-Hsiu Cheng
Graduate Institute of Electronic Engineering, National Taiwan University, Taipei, Taiwan
Sy-Yen Kuo
Sy-Yen Kuo
National Taiwan University