🤖 AI Summary
Binary and ternary neural network (BNN/TNN) inference on RRAM-based compute-in-memory (CIM) architectures is hindered by device non-idealities, while existing toolchains are fragmented—supporting only compilation, simulation, or design-space exploration (DSE) in isolation—and rely on 8-bit quantization, limiting accuracy and efficiency.
Method: This work introduces the first modular CIM toolbox enabling joint DSE for BNNs and TNNs. It integrates RRAM physics-aware modeling, sparsity-aware block-transpose mapping, a customized compiler stack, and an accuracy–hardware co-simulator to unify weight mapping, array mapping, and accuracy evaluation in an end-to-end optimized flow.
Contribution/Results: The toolbox overcomes siloed optimization by enabling rapid, cross-parameter accuracy estimation—spanning array sizes, cell variation rates, and mapping strategies—thereby accelerating the full pipeline from early-stage evaluation to chip deployment. Open-source implementation is provided.
📝 Abstract
Using Resistive Random Access Memory (RRAM) crossbars in Computing-in-Memory (CIM) architectures offers a promising solution to overcome the von Neumann bottleneck. Due to non-idealities like cell variability, RRAM crossbars are often operated in binary mode, utilizing only two states: Low Resistive State (LRS) and High Resistive State (HRS). Binary Neural Networks (BNNs) and Ternary Neural Networks (TNNs) are well-suited for this hardware due to their efficient mapping. Existing software projects for RRAM-based CIM typically focus on only one aspect: compilation, simulation, or Design Space Exploration (DSE). Moreover, they often rely on classical 8 bit quantization. To address these limitations, we introduce CIM-Explorer, a modular toolkit for optimizing BNN and TNN inference on RRAM crossbars. CIM-Explorer includes an end-to-end compiler stack, multiple mapping options, and simulators, enabling a DSE flow for accuracy estimation across different crossbar parameters and mappings. CIM-Explorer can accompany the entire design process, from early accuracy estimation for specific crossbar parameters, to selecting an appropriate mapping, and compiling BNNs and TNNs for a finalized crossbar chip. In DSE case studies, we demonstrate the expected accuracy for various mappings and crossbar parameters. CIM-Explorer can be found on GitHub.