Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction

📅 2025-08-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Conventional architectures face energy-efficiency bottlenecks due to data movement, while RRAM-based in-memory computing (IMC) suffers from device non-idealities and scalability limitations. To address these challenges, this paper proposes MELISO+, a full-stack distributed IMC framework. Leveraging algorithm-hardware co-design, it introduces a two-level error correction mechanism that suppresses first- and second-order arithmetic errors by over 90%. Its scalable distributed architecture supports matrix operations up to 65,000 × 65,000 dimensions. MELISO+ is tailored for high-dimensional workloads—including large language models and generative AI—achieving superior accuracy, energy efficiency, and latency on low-precision RRAM devices. Compared to conventional von Neumann architectures, it delivers a 3–5×10³ improvement in energy efficiency and a 100× reduction in latency.

Technology Category

Application Category

📝 Abstract
Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement. In-memory computing with Resistive Random Access Memory (RRAM) addresses this by co-integrating memory and processing, but faces significant hurdles related to device-level non-idealities and poor scalability for large computing tasks. Here, we introduce extbf{MELISO+} (In- extbf{Me}mory extbf{Li}near extbf{So}lver), a full-stack, distributed framework for energy-efficient in-memory computing. MELISO+ proposes a novel two-tier error correction mechanism to mitigate device non-idealities and develops a distributed RRAM computing framework to enable matrix computations exceeding dimensions of $65,000 imes 65,000$. This approach reduces first- and second-order arithmetic errors due to device non-idealities by over 90%, enhances energy efficiency by three to five orders of magnitude, and decreases latency 100-fold. Hence, MELISO+ allows lower-precision RRAM devices to outperform high-precision device alternatives in accuracy, energy and latency metrics. By unifying algorithm-hardware co-design with scalable architecture, MELISO+ significantly advances sustainable, high-dimensional computing suitable for applications like large language models and generative AI.
Problem

Research questions and friction points this paper is trying to address.

Addressing energy-intensive data movement in conventional computing architectures
Mitigating device-level non-idealities in resistive random access memory computing
Enabling scalable distributed in-memory computing for large matrix operations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Two-tier error correction mechanism
Distributed RRAM computing framework
Algorithm-hardware co-design unification
🔎 Similar Papers
No similar papers found.
H
Huynh Q. N. Vo
School of Industrial Engineering and Management, Oklahoma State University, Stillwater, 74078, OK, USA.
M
Md Tawsif Rahman Chowdhury
Department of Electrical and Computer Engineering, Wayne State University, Detroit, 48202, MI, USA.
Paritosh Ramanan
Paritosh Ramanan
Oklahoma State University
Decentralized OptimizationDistributed ComputingBlockchainFederated LearningDifferential Privacy
Murat Yildirim
Murat Yildirim
Wayne State University
Power SystemsPredictive AnalyticsCondition Based Maintenance
G
Gozde Tutuncuoglu
Department of Electrical and Computer Engineering, Wayne State University, Detroit, 48202, MI, USA.