HAVEN: Hybrid Automated Verification ENgine for UVM Testbench Synthesis with LLMs

📅 2026-04-30

📈 Citations: 0

✨ Influential: 0

career value

173K/year

🤖 AI Summary

This work addresses the challenge that large language models (LLMs) struggle to reliably generate Universal Verification Methodology (UVM) testbenches and test sequences due to insufficient training data in hardware description languages, thereby limiting IC verification efficiency. To overcome this, the authors propose HAVEN, a novel system that integrates LLM-based agents, a protocol-aware domain-specific language (DSL), and predefined Jinja2 templates to circumvent direct HDL code generation by LLMs. In HAVEN, an LLM agent parses design specifications to produce a structured architecture; Jinja2 templates instantiate UVM components; and the DSL decomposes and iteratively refines test sequences, guided by a coverage-gap feedback mechanism to progressively enhance verification coverage. Evaluated on 19 open-source IPs, HAVEN achieves 100% compilation success, 90.6% average code coverage, and 87.9% functional coverage, establishing a new state of the art in LLM-assisted verification.

📝 Abstract

Integrated Circuit (IC) verification consumes nearly 70% of the IC development cycle, and recent research leverages Large Language Models (LLMs) to automatically generate testbenches and reduce verification overhead. However, LLMs have difficulty generating testbenches correctly. Unlike high-level programming languages, Hardware Description Languages (HDLs) are extremely rare in LLMs training data, leading LLMs to produce incorrect code. To overcome challenges when using LLMs to generate Universal Verification Methodology (UVM) testbenches and sequences, wepropose HAVEN (Hybrid Automated Verification ENgine) to prevent LLMs from writing HDL directly. For UVM testbench generation, HAVEN utilizes LLM agents to analyze design specifications to produce a structured architectural plan. The HAVEN Template Engine then combines with predefined and protocol-specific templates to generate all UVM components with correct bus-handshake timing. For UVM sequence generation, HAVEN introduces a Protocol-Aware Sequence Domain-Specific Language (DSL) that decomposes sequences into fine-grained step types. A set of predefined DSL patterns first establishes sequences that achieve a high coverage rate without LLM involvement. HAVEN continues to improve the coverage rate by iteratively leveraging LLM agents to analyze coverage gap reports and compose additional targeted DSL sequences. Unlike previous works, HAVEN is the first system that utilizes pre-defined, protocol-specific Jinja2 templates to generate all UVM components and UVM sequences using our proposed Protocol-Aware DSL and rule-based code generator. Our experimental results on 19 open-source IP designs spanning three interface protocols (Direct, Wishbone, AXI4-Lite) show that HAVEN achieves 100% compilation success, 90.6% code coverage, and 87.9% functional coverage on average, and is SOTA among LLM-assisted testbench generation systems.

Problem

Research questions and friction points this paper is trying to address.

UVM testbench synthesis

Large Language Models

Hardware Description Languages

verification automation

code correctness

Innovation

Methods, ideas, or system contributions that make the work stand out.

UVM testbench synthesis

Large Language Models (LLMs)

Protocol-Aware DSL