RIROS: A Parallel RTL Fault SImulation FRamework with TwO-Dimensional Parallelism and Unified Schedule

📅 2025-08-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Parallel efficiency in RTL-level fault simulation for safety-critical chips (e.g., autonomous driving, embodied AI) is severely hindered by dynamic fault propagation paths and highly imbalanced task loads. Method: This paper proposes a two-dimensional parallel architecture integrating structural-level parallelism (with work-stealing) and fault-level parallelism, coupled with a unified compute/global-synchronization scheduling mechanism to eliminate simulation bubbles. The approach fundamentally restructures the conventional serial execution and synchronous paradigm, enabling both high throughput for lightweight tasks and fine-grained decomposition for heavy-load tasks. Contribution/Results: Experimental evaluation demonstrates speedups of 7.0× over the state-of-the-art academic RTL fault simulator and 11.0× over leading commercial tools, significantly reducing verification turnaround time and enabling scalable functional safety compliance verification.

Technology Category

Application Category

📝 Abstract
With the rapid development of safety-critical applications such as autonomous driving and embodied intelligence, the functional safety of the corresponding electronic chips becomes more critical. Ensuring chip functional safety requires performing a large number of time-consuming RTL fault simulations during the design phase, significantly increasing the verification cycle. To meet time-to-market demands while ensuring thorough chip verification, parallel acceleration of RTL fault simulation is necessary. Due to the dynamic nature of fault propagation paths and varying fault propagation capabilities, task loads in RTL fault simulation are highly imbalanced, making traditional singledimension parallel methods, such as structural-level parallelism, ineffective. Through an analysis of fault propagation paths and task loads, we identify two types of tasks in RTL fault simulation: tasks that are few in number but high in load, and tasks that are numerous but low in load. Based on this insight, we propose a two-dimensional parallel approach that combines structurallevel and fault-level parallelism to minimize bubbles in RTL fault simulation. Structural-level parallelism combining with workstealing mechanism is used to handle the numerous low-load tasks, while fault-level parallelism is applied to split the high-load tasks. Besides, we deviate from the traditional serial execution model of computation and global synchronization in RTL simulation by proposing a unified computation/global synchronization scheduling approach, which further eliminates bubbles. Finally, we implemented a parallel RTL fault simulation framework, RIROS. Experimental results show a performance improvement of 7.0 times and 11.0 times compared to the state-of-the-art RTL fault simulation and a commercial tool.
Problem

Research questions and friction points this paper is trying to address.

Accelerating RTL fault simulation for chip functional safety
Addressing task load imbalance in parallel fault simulation
Reducing verification cycle time with two-dimensional parallelism
Innovation

Methods, ideas, or system contributions that make the work stand out.

Two-dimensional parallelism combining structural and fault levels
Work-stealing mechanism for load balancing
Unified computation and global synchronization scheduling
🔎 Similar Papers
No similar papers found.
Jiaping Tang
Jiaping Tang
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences
RTL SimulationAutomatic RTL DebugSoftware/Hardware co-acceleration
Jianan Mu
Jianan Mu
Institute of Computing Technology, State Key Laboratory of Processors (SKLP), CAS
Design AutomationAccelaretorPrivacy Preserving Computing
Z
Zizhen Liu
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
G
Ge Yu
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; CASTEST Co., Ltd., Beijing, China
T
Tenghui Hua
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; CASTEST Co., Ltd., Beijing, China
B
Bin Sun
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; CASTEST Co., Ltd., Beijing, China
S
Silin Liu
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
J
Jing Ye
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; CASTEST Co., Ltd., Beijing, China
Huawei Li
Huawei Li
Institute of Computing Technology, Chinese Academy of Sciences
computer engineering