SynDCIM: A Performance-Aware Digital Computing-in-Memory Compiler with Multi-Spec-Oriented Subcircuit Synthesis

πŸ“… 2024-11-25
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing digital compute-in-memory (DCIM) macro designs rely on manual implementation, resulting in low design efficiency and difficulty in simultaneously optimizing power, area, and latencyβ€”thus limiting their performance potential. This paper proposes an agile compilation framework tailored for DCIM macros. It introduces a novel multi-specification-guided subcircuit synthesis algorithm that enables end-to-end automatic mapping from performance metrics to physical layout. A scalable, parameterized subcircuit library is constructed to support multi-objective co-optimization. The framework integrates analytical performance modeling, constraint-driven search, and automated layout generation. Implemented in 40 nm CMOS, the compiler achieves MAC throughput and energy efficiency on silicon comparable to state-of-the-art hand-designed macros, while significantly improving overall design quality. Experimental validation confirms both the practicality and advancement of the proposed methodology.

Technology Category

Application Category

πŸ“ Abstract
Digital Computing-in-Memory (DCIM) is an innovative technology that integrates multiply-accumulation (MAC) logic directly into memory arrays to enhance the performance of modern AI computing. However, the need for customized memory cells and logic components currently necessitates significant manual effort in DCIM design. Existing tools for facilitating DCIM macro designs struggle to optimize subcircuit synthesis to meet user-defined performance criteria, thereby limiting the potential system-level acceleration that DCIM can offer. To address these challenges and enable agile design of DCIM macros with optimal architectures, we present SynDCIM, a performance-aware DCIM compiler that employs multi-spec-oriented subcircuit synthesis. SynDCIM features an automated performance-to-layout generation process that aligns with user-defined performance expectations. This is supported by a scalable subcircuit library and a multi-spec-oriented searching algorithm for effective subcircuit synthesis. The effectiveness of SynDCIM is demonstrated through extensive experiments and validated with a test chip fabricated in a 40nm CMOS process. Testing results reveal that designs generated by SynDCIM exhibit competitive performance when compared to state-of-the-art manually designed DCIM macros.
Problem

Research questions and friction points this paper is trying to address.

DCIM Design
Performance Requirements
AI Acceleration
Innovation

Methods, ideas, or system contributions that make the work stand out.

SynDCIM
DCIM Optimization
Memory-centric Computing
πŸ”Ž Similar Papers
No similar papers found.
K
Kunming Shao
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
Fengshi Tian
Fengshi Tian
Now: HKUST | Former: Fudan University & Westlake University
VLSI and Intelligence
X
Xiaomeng Wang
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
Jiakun Zheng
Jiakun Zheng
The Hong Kong University of Science and Technology
Computing-in-MemoryAI Accelerator
J
Jia Chen
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
Jingyu He
Jingyu He
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
H
Hui Wu
Westlake University, Hangzhou, China
Jinbo Chen
Jinbo Chen
Westlake University, Hangzhou, China
X
Xihao Guan
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
Yi Deng
Yi Deng
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
Fengbin Tu
Fengbin Tu
Assistant Professor at HKUST
AI ChipComputing-in-MemoryComputer ArchitectureReconfigurable Computing
J
Jie Yang
Westlake University, Hangzhou, China
M
Mohamad Sawan
Westlake University, Hangzhou, China
K
Kwang-Ting Cheng
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China
C
Chi-Ying Tsui
The Hong Kong University of Science and Technology, Hong Kong SAR, China; AI Chip Center for Emerging Smart Systems (ACCESS), Hong Kong SAR, China