Accelerating materials discovery using foundation model based In-context active learning

๐Ÿ“… 2026-03-12
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

207K/year
๐Ÿค– AI Summary
This work proposes In-Context Active Learning (ICAL), a novel approach that leverages the pre-trained tabular foundation model TabPFN for materials discovery. Traditional active learning methods are hindered by the rigid kernel assumptions of Gaussian processes or the unreliable uncertainty estimates of random forests under small-sample conditions. ICAL circumvents these limitations by exploiting TabPFNโ€™s Transformer architecture to perform Bayesian inference without fine-tuning, achieving well-calibrated uncertainty estimates in a single forward pass. Evaluated across ten materials datasets, ICAL outperforms conventional methods on eight, reducing the required number of experiments by 52% and 29.77% on average compared to Gaussian processes and random forests, respectively, while demonstrating significantly superior uncertainty calibration.

Technology Category

Application Category

๐Ÿ“ Abstract
Active learning (AL) has emerged as a powerful paradigm for accelerating materials discovery by iteratively steering experiments toward the most promising candidates, reducing costly synthesis-and-characterization cycles. However, current AL relies predominantly on Gaussian Process (GP) and Random Forest (RF) surrogates with complementary limitations: GP underfits complex composition--property landscapes due to rigid kernel assumptions, while RF produces unreliable uncertainty estimates in small-data regimes, precisely where most materials datasets reside (with < 500 samples). Here we propose foudaiton model based In-Context Active Learning (ICAL), replacing conventional surrogates with TabPFN, a transformer-based foundation model pre-trained on millions of synthetic tasks to meta-learn a universal prior over tabular data. TabPFN performs principled Bayesian inference in a single forward pass without dataset-specific retraining, delivering well-calibrated predictive uncertainty where GP and RF fail most severely. Benchmarked against GP and RF across 10 materials datasets spanning copper alloy hardness and electrical conductivity, bulk metallic glass-forming ability, and crystal lattice thermal conductivity, TabPFN wins on 8 out of 10 datasets, achieving a mean saving of 52\% in extra experiments/evaluations relative to GP and 29.77% relative to RF. Cross-validation analysis confirms that TabPFN's advantage stems from superior uncertainty calibration,achieving the lowest Negative Log-Likelihood and Area Under the Sparsification Error curve among all surrogates. Our work demonstrates that a pre-trained foundation model can serve as a highly effective surrogate for accelerating active learning-based materials discovery.
Problem

Research questions and friction points this paper is trying to address.

active learning
materials discovery
surrogate models
uncertainty estimation
small-data regimes
Innovation

Methods, ideas, or system contributions that make the work stand out.

foundation model
active learning
TabPFN
materials discovery
uncertainty calibration
๐Ÿ”Ž Similar Papers
2023-11-30Citations: 0
๐Ÿ’ผ Related Jobs
Postdoctoral Fellow โ€“ AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizerโ€™s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of lifeโ€™s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site โ€“ U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid