From GPUs to RRAMs: Distributed In-Memory Primal-Dual Hybrid Gradient Method for Solving Large-Scale Linear Optimization Problem

📅 2025-09-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Conventional architectures struggle to support large-scale linear optimization, and existing algorithms are poorly suited for resistive random-access memory (RRAM)-based in-memory computing (IMC). Method: This paper proposes an algorithm–hardware co-designed distributed IMC framework. It is the first to implement the primal-dual hybrid gradient (PDHG) algorithm on RRAM, employing a symmetric block-matrix structure to unify distributed crossbar operations. This design significantly reduces write overhead from frequent reprogramming and enhances robustness against device non-idealities. Leveraging the MELISO+ physical-level simulation framework, the approach integrates analog-domain matrix computation with robust optimization techniques. Results: On large-scale linear programming tasks, the framework achieves solution accuracy comparable to GPU-based solvers, while delivering up to three orders-of-magnitude reduction in energy consumption and latency. These results demonstrate both the feasibility and superiority of IMC for solving large-scale optimization problems.

Technology Category

Application Category

📝 Abstract
The exponential growth of computational workloads is surpassing the capabilities of conventional architectures, which are constrained by fundamental limits. In-memory computing (IMC) with RRAM provides a promising alternative by providing analog computations with significant gains in latency and energy use. However, existing algorithms developed for conventional architectures do not translate to IMC, particularly for constrained optimization problems where frequent matrix reprogramming remains cost-prohibitive for IMC applications. Here we present a distributed in-memory primal-dual hybrid gradient (PDHG) method, specifically co-designed for arrays of RRAM devices. Our approach minimizes costly write cycles, incorporates robustness against device non-idealities, and leverages a symmetric block-matrix formulation to unify operations across distributed crossbars. We integrate a physics-based simulation framework called MELISO+ to evaluate performance under realistic device conditions. Benchmarking against GPU-accelerated solvers on large-scale linear programs demonstrates that our RRAM-based solver achieves comparable accuracy with up to three orders of magnitude reductions in energy consumption and latency. These results demonstrate the first PDHG-based LP solver implemented on RRAMs, showcasing the transformative potential of algorithm-hardware co-design for solving large-scale optimization through distributed in-memory computing.
Problem

Research questions and friction points this paper is trying to address.

Solving large-scale linear optimization problems efficiently
Overcoming computational limitations of conventional architectures
Adapting algorithms for in-memory computing with RRAM
Innovation

Methods, ideas, or system contributions that make the work stand out.

Distributed in-memory PDHG method for RRAM arrays
Minimizes write cycles and handles device non-idealities
Uses symmetric block-matrix for distributed crossbar operations
🔎 Similar Papers
No similar papers found.
H
Huynh Q. N. Vo
School of Industrial Engineering and Management, Oklahoma State University, Stillwater, OK, USA
M
Md Tawsif Rahman Chowdhury
Energy Systems and Infrastructure Assessment Division, Argonne National Laboratory, Lemont, IL, USA
Paritosh Ramanan
Paritosh Ramanan
Oklahoma State University
Decentralized OptimizationDistributed ComputingBlockchainFederated LearningDifferential Privacy
G
Gozde Tutuncuoglu
Department of Electrical and Computer Engineering, Wayne State University, Detroit, MI, USA
Junchi Yang
Junchi Yang
Chinese University of Hong Kong, Shenzhen
OptimizationMachine Learning
Feng Qiu
Feng Qiu
Argonne National Laboratory
Mathematical programmingoptimizationpower systemsenergy systems
Murat Yildirim
Murat Yildirim
Wayne State University
Power SystemsPredictive AnalyticsCondition Based Maintenance