ForgeEDA: A Comprehensive Multimodal Dataset for Advancing EDA

📅 2025-05-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing EDA datasets suffer from limited scale, single-modality representation, and misalignment across heterogeneous design abstractions, hindering the development of AI-driven circuit design methodologies. To address these limitations, ForgeEDA introduces the first open-source, end-to-end multimodal IC design dataset, unifying diverse representations—including RTL code, mapped netlists, AIG graphs, and placement netlists—under a standardized schema. It is constructed via a robust end-to-end toolchain encompassing HDL parsing, logic synthesis, formal graph modeling, and physical design. This dataset fills a critical gap in large-scale, highly aligned, multi-granularity circuit data. Empirical evaluation reveals significant performance bottlenecks of mainstream EDA algorithms in PPA (Power, Performance, Area) optimization. Models trained on ForgeEDA demonstrate improved prediction accuracy and enhanced cross-task transferability, thereby advancing the AI-EDA methodology.

Technology Category

Application Category

📝 Abstract
We introduce ForgeEDA, an open-source comprehensive circuit dataset across various categories. ForgeEDA includes diverse circuit representations such as Register Transfer Level (RTL) code, Post-mapping (PM) netlists, And-Inverter Graphs (AIGs), and placed netlists, enabling comprehensive analysis and development. We demonstrate ForgeEDA's utility by benchmarking state-of-the-art EDA algorithms on critical tasks such as Power, Performance, and Area (PPA) optimization, highlighting its ability to expose performance gaps and drive advancements. Additionally, ForgeEDA's scale and diversity facilitate the training of AI models for EDA tasks, demonstrating its potential to improve model performance and generalization. By addressing limitations in existing datasets, ForgeEDA aims to catalyze breakthroughs in modern IC design and support the next generation of innovations in EDA.
Problem

Research questions and friction points this paper is trying to address.

Provides multimodal circuit dataset for EDA analysis
Benchmarks EDA algorithms for PPA optimization
Enables AI model training for EDA tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Open-source multimodal circuit dataset ForgeEDA
Includes RTL, PM netlists, AIGs, placed netlists
Facilitates AI training for EDA tasks
🔎 Similar Papers
No similar papers found.
Z
Zhengyuan Shi
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong S.A.R.
Z
Zeju Li
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong S.A.R.
C
Chengyu Ma
Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
Yunhao Zhou
Yunhao Zhou
Shanghai Jiao Tong University
EDAGNNLLM
Ziyang Zheng
Ziyang Zheng
Shanghai Jiao Tong University
Signal ProcessingInverse ProblemPhotonic Computing
J
Jiawei Liu
School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, China
Hongyang Pan
Hongyang Pan
Fudan university
Logic Synthesis
Lingfeng Zhou
Lingfeng Zhou
Shanghai Jiao Tong University
K
Kezhi Li
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong S.A.R.
J
Jiaying Zhu
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong S.A.R.
L
Lingwei Yan
College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China
Z
Zhiqiang He
College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China
Chenhao Xue
Chenhao Xue
School of Integrated Circuits, Peking University
AIcomputer architectureEDA
W
Wentao Jiang
Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
F
Fan Yang
School of Microelectronics, State Key Laboratory of Integrated Chips and System, Fudan University, Shanghai, China
Guangyu Sun
Guangyu Sun
School of Integrated Circuits, Peking University
Computer ArchitectureDesign AutomationEmerging Memory
Xiaoyan Yang
Xiaoyan Yang
Advanced Digital Sciences Center
databasedeep learningtext mining
G
Gang Chen
College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China
Chuan Shi
Chuan Shi
Beijing University of Posts and Telecommunications
data miningmachine learningsocial network analysis
Z
Z. Chu
J
Jun Yang
School of Intergrated Circuits, Southeast University, Nanjing, China
Q
Qiang Xu
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong S.A.R.