🤖 AI Summary
To address bandwidth and energy-efficiency bottlenecks of the MTTKRP (Matricized Tensor Times Khatri–Rao Product) kernel in in-memory computing, this work proposes a novel photonic-in-memory architecture based on photonic static random-access memory (pSRAM). Methodologically, we first design a scalable pSRAM array integrated with wavelength-division multiplexing (WDM) to enable hyperspectral parallel computation, and develop a hardware-aware MTTKRP mapping strategy alongside the first sustained-performance prediction model for pSRAM. Our contributions are threefold: (1) a WDM-enhanced hyperspectral operation mechanism; (2) a comprehensive pSRAM performance modeling framework tailored to MTTKRP; and (3) achieving 17 PetaOps of sustained computational throughput under realistic hardware configurations—significantly outperforming state-of-the-art electronic in-memory computing systems.
📝 Abstract
Photonics-based in-memory computing systems have demonstrated a significant speedup over traditional transistor-based systems because of their ultra-fast operating frequencies and high data bandwidths. Photonic static random access memory (pSRAM) is a crucial component for achieving the objective of ultra-fast photonic in-memory computing systems. In this work, we model and evaluate the performance of a novel photonic SRAM array architecture in development. Additionally, we examine hyperspectral operation through wavelength division multiplexing (WDM) to enhance the throughput of the pSRAM array. We map Matricized Tensor Times Khatri-Rao Product (MTTKRP), a computational kernel commonly used in tensor decomposition, to the proposed pSRAM array architecture. We also develop a predictive performance model to estimate the sustained performance of different configurations of the pSRAM array. Using the predictive performance model, we demonstrate that the pSRAM array achieves 17 PetaOps while performing MTTKRP in a practical hardware configuration.