A Short Note on Batch-efficient Divide-and-Conquer Algorithm for EigenDecomposition

πŸ“… 2026-04-29
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

214K/year
πŸ€– AI Summary
This work addresses the high computational cost of eigendecomposition for small-batch, low-dimensional (dimension < 64) matrices in deep neural networks. To overcome the limitation of existing QR-based algorithms, which are typically restricted to very small matrices (dimension < 32), the authors propose a GPU-optimized batched divide-and-conquer eigendecomposition algorithm tailored for parallel architectures. By leveraging an efficient divide-and-conquer strategy, the method significantly enhances throughput for batch processing while maintaining numerical stability. Experimental results demonstrate that, for matrices with dimensions below 64, the proposed approach achieves substantially faster computation than PyTorch’s built-in SVD implementation without compromising accuracy.
πŸ“ Abstract
EigenDecomposition (ED) is at the heart of many computer vision algorithms and applications. One crucial bottleneck limiting its usage is the expensive computation cost, particularly for a mini-batch of matrices in deep neural networks. Our previous work proposed a dedicated QR-based ED algorithm for batched small matrices (dim${<}32$). This short paper targets the limitation and proposes a batch-efficient Divide-and-Conquer based ED algorithm for larger matrices. The numerical test shows that for a mini-batch of matrices whose dimensions are smaller than $64$, our method can be much faster than the Pytorch SVD function.
Problem

Research questions and friction points this paper is trying to address.

EigenDecomposition
batch processing
computation cost
large matrices
deep neural networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Batch-efficient
Divide-and-Conquer
EigenDecomposition
Small-to-medium matrices
GPU acceleration
πŸ”Ž Similar Papers