ForgeHLS: A Large-Scale, Open-Source Dataset for High-Level Synthesis

📅 2025-07-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
The EDA community lacks large-scale, open-source, multi-level integrated circuit datasets, severely hindering algorithm evaluation and AI model training for critical tasks such as high-level synthesis (HLS). To address this, we introduce CircuitBench—the first open-source, large-scale circuit dataset—encompassing diverse designs including digital circuits, arithmetic units, and memory blocks. It provides four complementary representations: RTL code, post-mapping netlists, And-Inverter Graphs (AIGs), and post-placement netlists. Designed for diversity, extensibility, and task alignment, CircuitBench supports benchmarking and end-to-end deep learning for core EDA tasks like PPA (power, performance, area) optimization. Experimental results demonstrate significant improvements in AI model accuracy and robustness across cross-circuit generalization, few-shot adaptation, and optimization performance prediction. CircuitBench thus establishes a foundational data resource for AI-driven EDA research.

Technology Category

Application Category

📝 Abstract
We introduce ForgeEDA, an open-source comprehensive circuit dataset across various categories. ForgeEDA includes diverse circuit representations such as Register Transfer Level (RTL) code, Post-mapping (PM) netlists, And-Inverter Graphs (AIGs), and placed netlists, enabling comprehensive analysis and development. We demonstrate ForgeEDA's utility by benchmarking state-of-the-art EDA algorithms on critical tasks such as Power, Performance, and Area (PPA) optimization, highlighting its ability to expose performance gaps and drive advancements. Additionally, ForgeEDA's scale and diversity facilitate the training of AI models for EDA tasks, demonstrating its potential to improve model performance and generalization. By addressing limitations in existing datasets, ForgeEDA aims to catalyze breakthroughs in modern IC design and support the next generation of innovations in EDA.
Problem

Research questions and friction points this paper is trying to address.

Providing a large-scale open-source dataset for High-Level Synthesis (HLS)
Enabling comprehensive analysis of circuit representations like RTL and AIGs)
Facilitating AI model training for EDA tasks and PPA optimization)
Innovation

Methods, ideas, or system contributions that make the work stand out.

Open-source dataset with diverse circuit representations
Benchmarks EDA algorithms for PPA optimization
Facilitates AI model training for EDA tasks
🔎 Similar Papers
No similar papers found.
Zedong Peng
Zedong Peng
MIT
Operations ResearchOptimizationMixed-Integer ProgrammingProcess System Engineering
Z
Zeju Li
The Chinese University of Hong Kong , Hong Kong S.A.R.
M
Mingzhe Gao
Shanghai Jiao Tong University , Shanghai, China
Q
Qiang Xu
The Chinese University of Hong Kong , Hong Kong S.A.R.
C
Chen Zhang
Shanghai Jiao Tong University , Shanghai, China
Jieru Zhao
Jieru Zhao
Associate Professor, Shanghai Jiao Tong University
Hardware-software co-designAI acceleration and systemCompilerFPGAHigh-level synthesis