Therm-FM: Foundation Model is ALL YOU NEED for 3D-ICs Thermal Simulation

📅 2026-05-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

218K/year
🤖 AI Summary
This work addresses the limited reusability of traditional data-driven thermal simulation methods for 3D integrated circuits (3D-ICs), which rely heavily on large volumes of high-fidelity finite element data. To overcome this challenge, the authors propose Therm-FM, the first approach to transfer a pre-trained partial differential equation (PDE) foundation model to 3D-IC thermal simulation. By integrating neural operators with multi-fidelity learning and introducing a thermal equivalent modeling mechanism, Therm-FM enables highly accurate and efficient prediction of both steady-state and transient thermal fields using only minimal target-domain data. Evaluated on public and industrial-scale benchmarks, Therm-FM surpasses state-of-the-art methods with less than 20% of the training data, reducing average errors by up to 10.6×. Moreover, cross-chip adaptation requires merely 10–30 samples to match or exceed the performance of full-data baselines.
📝 Abstract
Data-driven thermal predictors for 3D-ICs are often trained from scratch for each chip design using many high-fidelity finite-element simulations, leading to high data-generation cost and costly cross-design reuse. We propose Therm-FM, a neural operator framework that adapts a pretrained partial differential equation (PDE) foundation model to steady-state and transient 3D-IC thermal simulation. The motivation is that steady-state and transient chip-level heat conduction respectively share elliptic and parabolic operator structures with diffusion-type PDEs, allowing pretrained diffusion priors to provide an effective initialization for thermal-field prediction under heterogeneous materials, dense TSV/microbump interconnects, and package-level boundary conditions. To further reduce data-generation cost, Therm-FM incorporates a thermal-equivalent multi-fidelity training strategy that uses low-cost approximate simulations for thermal-domain adaptation and limited high-fidelity samples for calibration. Experiments on public HotSpot benchmarks and industrial 3D-IC package benchmarks show that Therm-FM achieves up to a 10.6x reduction in mean error and surpasses prior best accuracy with less than 20% of the training data. In cross-chip adaptation, it matches or surpasses full-data baselines in several metrics using only 10--30 target samples. We release datasets, source code, and pretrained models at https://github.com/haiyangxin/Therm-FM.
Problem

Research questions and friction points this paper is trying to address.

3D-ICs
thermal simulation
data-driven modeling
high-fidelity simulation
cross-design reuse
Innovation

Methods, ideas, or system contributions that make the work stand out.

foundation model
neural operator
3D-IC thermal simulation
multi-fidelity learning
PDE adaptation
🔎 Similar Papers
No similar papers found.
Zhen Huang
Zhen Huang
National University of Defense Technology
distributed storageNLPmachine learning
H
Haiyang Xin
School of Computer Science and Technology, University of Science and Technology of China, Hefei, China
Wenkai Yang
Wenkai Yang
Renmin University of China
Natural Language ProcessingMachine Learning
Y
Yangbo Wei
Eastern Institute of Technology, Ningbo, China
Zhiping Yu
Zhiping Yu
Beihang University
deep learningremote sensingAIGC
Yu Zhang
Yu Zhang
University of Science and Technology of China
Efficient AI SystemsProgramming SystemsProgram AnalysisMulti-Modal Perception
Wei W. Xing
Wei W. Xing
The University of Sheffield
Bayesian optimizationElectronic design automation (EDA)AI4EDAmachine learning
T
Ting-Jung Lin
Ningbo Institute of Digital Twin, Eastern Institute of Technology, Ningbo, China; Engineering Research Center of Chiplet Design and Manufacturing of Zhejiang Province, Ningbo, China
L
Lei He
Eastern Institute of Technology, Ningbo, China