RTeAAL Sim: Using Tensor Algebra to Represent and Accelerate RTL Simulation (Extended Version)

📅 2026-01-26

📈 Citations: 0

✨ Influential: 0

career value

210K/year

🤖 AI Summary

RTL simulation on CPUs suffers from long compilation times, high instruction cache pressure, and front-end performance bottlenecks. This work presents the first formulation of RTL simulation as a sparse tensor algebra problem, wherein circuit structures are represented as tensors and simulation is executed using highly optimized tensor algebra kernels. This approach decouples simulation behavior from binary size, enabling direct leveraging of mature tensor optimization techniques to significantly reduce both compilation overhead and instruction cache pressure. A prototype implementation achieves runtime performance comparable to the highly optimized Verilator across diverse CPU architectures and instruction set architectures, demonstrating the effectiveness and generality of the proposed paradigm.

Technology Category

Application Category

📝 Abstract

RTL simulation on CPUs remains a persistent bottleneck in hardware design. State-of-the-art simulators embed the circuit directly into the simulation binary, resulting in long compilation times and execution that is fundamentally CPU frontend-bound, with severe instruction-cache pressure. This work proposes RTeAAL Sim, which reformulates RTL simulation as a sparse tensor algebra problem. By representing RTL circuits as tensors and simulation as a sparse tensor algebra kernel, RTeAAL Sim decouples simulation behavior from binary size and makes RTL simulation amenable to well-studied tensor algebra optimizations. We demonstrate that a prototype of our tensor-based simulator, even with a subset of these optimizations, already mitigates the compilation overhead and frontend pressure and achieves performance competitive with the highly optimized Verilator simulator across multiple CPUs and ISAs.

Problem

Research questions and friction points this paper is trying to address.

RTL simulation

CPU bottleneck

compilation overhead

instruction-cache pressure

frontend-bound execution

Innovation

Methods, ideas, or system contributions that make the work stand out.

tensor algebra

RTL simulation

sparse tensors

hardware simulation

frontend-bound optimization

🔎 Similar Papers

No similar papers found.

💼 Related Jobs

No related jobs found.

Authors to Follow