A large-scale nanocrystal database with aligned synthesis and properties enabling generative inverse design

📅 2026-01-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the long-standing reliance on trial-and-error in nanocrystal synthesis, primarily due to the complex relationship between synthesis parameters and material properties, compounded by a lack of high-quality, aligned data. To overcome this, the authors constructed a Nanocrystal Synthesis–Property (NSP) database comprising 160,000 records and introduced a human–AI collaborative inverse design paradigm. They developed NanoExtractor, a large language model enhanced with reinforcement strategies, to automatically extract structured synthesis protocols and performance metrics from scientific literature, achieving an expert-evaluated weighted accuracy of 88%—substantially outperforming existing models. Building on this, they trained a generative model, NanoDesigner, for inverse synthesis planning. The framework successfully designed viable synthetic routes for PbSe and MgF₂ nanocrystals; notably, an experimentally validated MgF₂ route guided by non-intuitive precursor ratios effectively suppressed unwanted byproducts.

Technology Category

Application Category

📝 Abstract
The synthesis of nanocrystals has been highly dependent on trial-and-error, due to the complex correlation between synthesis parameters and physicochemical properties. Although deep learning offers a potential methodology to achieve generative inverse design, it is still hindered by the scarcity of high-quality datasets that align nanocrystal synthesis routes with their properties. Here, we present the construction of a large-scale, aligned Nanocrystal Synthesis-Property (NSP) database and demonstrate its capability for generative inverse design. To extract structured synthesis routes and their corresponding product properties from literature, we develop NanoExtractor, a large language model (LLM) enhanced by well-designed augmentation strategies. NanoExtractor is validated against human experts, achieving a weighted average score of 88% on the test set, significantly outperforming chemistry-specialized (3%) and general-purpose LLMs (38%). The resulting NSP database contains nearly 160,000 aligned entries and serves as training data for our NanoDesigner, an LLM for inverse synthesis design. The generative capability of NanoDesigner is validated through the successful design of viable synthesis routes for both well-established PbSe nanocrystals and rarely reported MgF2 nanocrystals. Notably, the model recommends a counter-intuitive, non-stoichiometric precursor ratio (1:1) for MgF2 nanocrystals, which is experimentally confirmed as critical for suppressing byproducts. Our work bridges the gap between unstructured literature and data-driven synthesis, and also establishes a powerful human-AI collaborative paradigm for accelerating nanocrystal discovery.
Problem

Research questions and friction points this paper is trying to address.

nanocrystal synthesis
inverse design
data scarcity
synthesis-property alignment
trial-and-error
Innovation

Methods, ideas, or system contributions that make the work stand out.

nanocrystal inverse design
large language model (LLM)
synthesis-property alignment
generative AI for materials
NanoExtractor
🔎 Similar Papers
K
Kai Gu
MIIT Key Laboratory for Low-Dimensional Quantum Structure and Devices, School of Materials Sciences & Engineering, Beijing Institute of Technology, Beijing 100081, China
Yingping Liang
Yingping Liang
2022~2028 Ph.D. student at Beijing Institute of Technology
3D Vision
S
Senliang Peng
MIIT Key Laboratory for Low-Dimensional Quantum Structure and Devices, School of Materials Sciences & Engineering, Beijing Institute of Technology, Beijing 100081, China
A
Aotian Guo
MIIT Key Laboratory for Low-Dimensional Quantum Structure and Devices, School of Materials Sciences & Engineering, Beijing Institute of Technology, Beijing 100081, China
H
Haizheng Zhong
MIIT Key Laboratory for Low-Dimensional Quantum Structure and Devices, School of Materials Sciences & Engineering, Beijing Institute of Technology, Beijing 100081, China
Ying Fu
Ying Fu
Beijing Institute of Technology
Computer Vision