🤖 AI Summary
This work addresses the limited scalability and performance of memristor-based reservoir computing in long-sequence classification by introducing MARS, a novel architecture that integrates a memristor-friendly parallel reservoir with innovative subtraction-based skip connections. Operating within a gradient-free training framework that eliminates backpropagation, MARS substantially enhances model depth and computational efficiency while leveraging in-memory computing advantages. The proposed method outperforms state-of-the-art models—including LRU, S5, and Mamba—on multiple long-sequence benchmarks, reducing training time from minutes to hundreds of milliseconds and achieving speedups of up to 21×, thereby effectively overcoming the performance bottlenecks inherent in conventional reservoir computing approaches.
📝 Abstract
Memristive devices present a promising foundation for next-generation information processing by combining memory and computation within a single physical substrate. This unique characteristic enables efficient, fast, and adaptive computing, particularly well suited for deep learning applications. Among recent developments, the memristive-friendly echo state network (MF-ESN) has emerged as a promising approach that combines memristive-inspired dynamics with the training simplicity of reservoir computing, where only the readout layer is learned. Building on this framework, we propose memristive-friendly parallelized reservoirs (MARS), a simplified yet more effective architecture that enables efficient scalable parallel computation and deeper model composition through novel subtractive skip connections. This design yields two key advantages: substantial training speedups of up to 21x over the inherently lightweight echo state network baseline and significantly improved predictive performance. Moreover, MARS demonstrates what is possible with parallel memristive-friendly reservoir computing: on several long sequence benchmarks our compact gradient-free models substantially outperform strong gradient-based sequence models such as LRU, S5, and Mamba, while reducing full training time from minutes or hours down seconds or even only a few hundred milliseconds. Our work positions parallel memristive-friendly computing as a promising route towards scalable neuromorphic learning systems that combine high predictive capability with radically improved computational efficiency, while providing a clear pathway to energy-efficient, low-latency implementations on emerging memristive and in-memory hardware.