🤖 AI Summary
This work addresses the end-to-end generation of formally verifiable Verilog code from high-level hardware specifications—a longstanding challenge in hardware design automation, where conventional approaches lack formal correctness guarantees. We propose the first formality-aware evaluation benchmark for Verilog code generation; construct the first multi-level, reasoning-driven dataset tailored for hardware code synthesis, along with a lightweight model family; and integrate large language model–based reasoning with formal verification techniques—including combinational equivalence checking, model checking, Verilog syntactic constraint modeling, and domain-specific fine-tuning. Our approach achieves 100% formal verification pass rates across multiple representative hardware modules, while significantly improving functional correctness and synthesis feasibility. The open-sourced dataset and models have been widely adopted by the hardware design and formal methods communities.
📝 Abstract
This paper introduces VeriThoughts, a novel dataset designed for reasoning-based Verilog code generation. We establish a new benchmark framework grounded in formal verification methods to evaluate the quality and correctness of generated hardware descriptions. Additionally, we present a suite of specialized small-scale models optimized specifically for Verilog generation. Our work addresses the growing need for automated hardware design tools that can produce verifiably correct implementations from high-level specifications, potentially accelerating the hardware development process while maintaining rigorous correctness guarantees. Our code and data are available at href{https://github.com/wilyub/VeriThoughts}{this URL}.