MIREDO: MIP-Driven Resource-Efficient Dataflow Optimization for Computing-in-Memory Accelerator

📅 2025-10-30

📈 Citations: 0

✨ Influential: 0

career value

262K/year

🤖 AI Summary

Dataflow optimization for Compute-in-Memory (CIM) architectures faces a dual challenge: an exponentially large design space coupled with stringent hardware constraints. Method: This paper proposes a systematic co-optimization framework based on Mixed-Integer Programming (MIP). It integrates hierarchical hardware abstraction and cycle-accurate latency modeling to jointly optimize DNN workload mapping, dataflow scheduling, and CIM unit-level constraints—overcoming the local optima limitation of conventional heuristic approaches. Contribution/Results: Its key innovation lies in explicitly encoding non-differentiable architectural constraints as MIP formulations and enabling cross-layer trade-offs among resources, bandwidth, and latency. Evaluated across diverse mainstream DNN models and CIM hardware configurations, the framework achieves an average 2.7× speedup (up to 3.2×), significantly narrowing the gap between theoretical peak efficiency and realized system efficiency. It establishes a verifiable, scalable, and automated optimization paradigm for practical CIM accelerator deployment.

Technology Category

Application Category

📝 Abstract

Computing-in-Memory (CIM) architectures have emerged as a promising solution for accelerating Deep Neural Networks (DNNs) by mitigating data movement bottlenecks. However, realizing the potential of CIM requires specialized dataflow optimizations, which are challenged by an expansive design space and strict architectural constraints. Existing optimization approaches often fail to fully exploit CIM accelerators, leading to noticeable gaps between theoretical and actual system-level efficiency. To address these limitations, we propose the MIREDO framework, which formulates dataflow optimization as a Mixed-Integer Programming (MIP) problem. MIREDO introduces a hierarchical hardware abstraction coupled with an analytical latency model designed to accurately reflect the complex data transfer behaviors within CIM systems. By jointly modeling workload characteristics, dataflow strategies, and CIM-specific constraints, MIREDO systematically navigates the vast design space to determine the optimal dataflow configurations. Evaluation results demonstrate that MIREDO significantly enhances performance, achieving up to $3.2 imes$ improvement across various DNN models and hardware setups.

Problem

Research questions and friction points this paper is trying to address.

Optimizing dataflow for computing-in-memory DNN accelerators

Addressing design space complexity and architectural constraints

Bridging theoretical and actual system-level efficiency gaps

Innovation

Methods, ideas, or system contributions that make the work stand out.

MIP formulation for dataflow optimization problem

Hierarchical abstraction with analytical latency model

Joint modeling of workloads and CIM constraints

🔎 Similar Papers

No similar papers found.