mLR: Scalable Laminography Reconstruction based on Memoization

📅 2025-10-29

🏛️ Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

📈 Citations: 0

✨ Influential: 0

career value

238K/year

🤖 AI Summary

ADMM-FFT tomographic reconstruction suffers from high computational cost and substantial memory overhead. To address this, we propose memoized low-rank reconstruction (mLR), the first approach to integrate memoization into the ADMM-FFT iterative framework by caching repeated FFT computation results. mLR further incorporates cross-GPU variable offloading, hierarchical CPU memory management, and multi-node, multi-GPU parallelization to significantly improve memory efficiency and scalability. The method supports reconstructions up to 2K×2K×2K voxels and operates efficiently under memory constraints. Experimental evaluation demonstrates that mLR achieves an average speedup of 52.8% over baseline ADMM-FFT, with a maximum acceleration of 65.4%, while preserving reconstruction accuracy and exhibiting strong scalability across diverse hardware configurations.

Technology Category

Application Category

📝 Abstract

ADMM-FFT is an iterative method with high reconstruction accuracy for laminography but suffers from excessive computation time and large memory consumption. We introduce mLR, which employs memoization to replace the time-consuming Fast Fourier Transform (FFT) operations based on an unique observation that similar FFT operations appear in iterations of ADMM-FFT. We introduce a series of techniques to make the application of memoization to ADMM-FFT performance-beneficial and scalable. We also introduce variable offloading to save CPU memory and scale ADMM-FFT across GPUs within and across nodes. Using mLR, we are able to scale ADMM-FFT on an input problem of 2Kx2Kx2K, which is the largest input problem laminography reconstruction has ever worked on with the ADMM-FFT solution on limited memory; mLR brings 52.8% performance improvement on average (up to 65.4%), compared to the original ADMM-FFT.

Problem

Research questions and friction points this paper is trying to address.

ADMM-FFT laminography reconstruction has excessive computation time

Large memory consumption limits ADMM-FFT scalability for big datasets

Traditional methods cannot handle large 2Kx2Kx2K laminography reconstructions efficiently

Innovation

Methods, ideas, or system contributions that make the work stand out.

Memoization replaces FFT operations in ADMM-FFT

Variable offloading saves CPU memory across GPUs

Scales laminography reconstruction to 2Kx2Kx2K problem size

🔎 Similar Papers

No similar papers found.