Accelerating Locality-Driven Integration in Quantum Chemistry with Block-Structured Matrix Multiplication

📅 2026-05-11
📈 Citations: 0
Influential: 0
📄 PDF

career value

221K/year
🤖 AI Summary
This work addresses the challenge of efficiently computing locality-driven integrals in quantum chemistry, whose matrix representations exhibit dynamic structural sparsity that is poorly handled by existing dense or generic sparse GPU algorithms. To overcome this, the authors propose KerneLDI, a GPU-oriented block-structured matrix multiplication framework that translates spatial locality into highly parallel block computations through a unified block filtering representation, custom dense block multiplication kernels, and co-optimized data layouts. Integrated with a multi-GPU parallelization strategy, KerneLDI achieves up to 10× acceleration in exchange-correlation energy evaluation while preserving numerical accuracy, significantly speeds up end-to-end self-consistent field iterations, and enhances ab initio molecular dynamics throughput by nearly sixfold.
📝 Abstract
Locality-driven integration is a pervasive computational pattern in quantum chemistry, arising whenever spatially localized basis functions interact through numerical quadrature or integral screening. The dominant matrix multiplications in these tasks exhibit dynamic, structured sparsity driven by spatial locality, posing significant challenges for both dense batched kernels and generic sparse formats on GPUs. We present KerneLDI, a GPU-oriented framework that addresses this regime by co-designing data layout, screening logic, and matrix-computation operators to realize block-structured matrix multiplication for locality-driven integration. KerneLDI reorganizes operand matrices into a unified block-filtered representation that retains only spatially relevant blocks, and executes the resulting contractions with customized dense block multipliers that adapt proven dense-matmul optimizations to retained block pairs. We develop and evaluate KerneLDI on exchange--correlation (EXC) integration in Kohn--Sham density functional theory, a representative and computationally critical instance of this pattern. Across diverse molecular systems, KerneLDI preserves numerical accuracy while delivering up to 10$\times$ speedup for EXC evaluation over a dense GPU baseline, scales favorably with increasing system size and multi-GPU parallelism, accelerates end-to-end self-consistent field calculations, and yields nearly 6$\times$ throughput improvement for ab initio molecular dynamics.
Problem

Research questions and friction points this paper is trying to address.

locality-driven integration
structured sparsity
quantum chemistry
matrix multiplication
GPU acceleration
Innovation

Methods, ideas, or system contributions that make the work stand out.

block-structured matrix multiplication
locality-driven integration
GPU acceleration
quantum chemistry
structured sparsity
🔎 Similar Papers
No similar papers found.