Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

📅 2025-02-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Material synthesis has long relied on empirical trial-and-error, hindering innovation in energy, catalysis, and biomedicine. To address this, we introduce AlchemyBench—the first LLM-specific benchmark for end-to-end materials synthesis—comprising 17K expert-validated synthesis protocols. It supports three core tasks: precursor/equipment prediction, synthetic procedure generation, and characterization outcome forecasting. We propose LLM-as-a-Judge, an automated evaluation framework achieving a Pearson correlation of 0.89 with human expert scores and statistical agreement comparable to domain experts. Our approach pioneers end-to-end LLM-based synthesis intelligence via large-scale synthesis data, multi-stage task modeling, and prompt engineering. AlchemyBench fills a critical gap by providing the first high-quality, expert-curated benchmark for materials synthesis, significantly enhancing both the efficiency and reliability of synthesis protocol generation and assessment.

Technology Category

Application Category

📝 Abstract
Materials synthesis is vital for innovations such as energy storage, catalysis, electronics, and biomedical devices. Yet, the process relies heavily on empirical, trial-and-error methods guided by expert intuition. Our work aims to support the materials science community by providing a practical, data-driven resource. We have curated a comprehensive dataset of 17K expert-verified synthesis recipes from open-access literature, which forms the basis of our newly developed benchmark, AlchemyBench. AlchemyBench offers an end-to-end framework that supports research in large language models applied to synthesis prediction. It encompasses key tasks, including raw materials and equipment prediction, synthesis procedure generation, and characterization outcome forecasting. We propose an LLM-as-a-Judge framework that leverages large language models for automated evaluation, demonstrating strong statistical agreement with expert assessments. Overall, our contributions offer a supportive foundation for exploring the capabilities of LLMs in predicting and guiding materials synthesis, ultimately paving the way for more efficient experimental design and accelerated innovation in materials science.
Problem

Research questions and friction points this paper is trying to address.

Automating materials discovery process
Data-driven synthesis prediction framework
LLM-based expert-level evaluation system
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large-scale synthesis dataset curation
LLM-as-a-Judge framework
End-to-end synthesis prediction framework
🔎 Similar Papers
No similar papers found.
H
Heegyu Kim
Department of Artificial Intelligence, Ajou University, Suwon 16499, Republic of Korea
T
Taeyang Jeon
Department of Artificial Intelligence, Ajou University, Suwon 16499, Republic of Korea
S
Seungtaek Choi
Department of Materials Science and Engineering and Department of Energy Systems Research, Ajou University, Suwon 16499, Republic of Korea
Jihoon Hong
Jihoon Hong
Department of Materials Science and Engineering and Department of Energy Systems Research, Ajou University, Suwon 16499, Republic of Korea
D
Dongwon Jeon
Department of Materials Science and Engineering and Department of Energy Systems Research, Ajou University, Suwon 16499, Republic of Korea
S
Sungbum Cho
Department of Materials Science and Engineering and Department of Energy Systems Research, Ajou University, Suwon 16499, Republic of Korea
G
Ga-Yeon Baek
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
K
Kyung-Won Kwak
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
D
Dong-Hee Lee
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
S
Sun-Jin Choi
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
J
Jisu Bae
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
C
Chihoon Lee
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
Y
Yunseo Kim
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
J
Jinsung Park
Division of Materials Science and Engineering, Hanyang University, Seoul 04763, Republic of Korea
Hyunsouk Cho
Hyunsouk Cho
Assistant professor, Ajou university, Korea
Artificial Intelligence