Chemist-X: Large Language Model-empowered Agent for Reaction Condition Recommendation in Chemical Synthesis

📅 2023-11-16
📈 Citations: 8
Influential: 1
📄 PDF
🤖 AI Summary
This study addresses the longstanding bottleneck in reaction condition optimization (RCO) for chemical synthesis—its heavy reliance on empirical knowledge and trial-and-error. We propose the first end-to-end, autonomous AI agent framework for RCO. Methodologically, we introduce a novel four-layer synergistic paradigm: Retrieval-Augmented Generation (RAG) dynamically retrieves domain-specific chemical knowledge; a large language model (LLM) generates executable, controllable experimental protocols; a custom-built chemical computer-aided design (CAD) tool translates protocols into experimentally feasible configurations; and a robotic system executes fully automated wet-lab experiments in closed-loop. Validated across diverse organic reactions, the framework achieves fully autonomous operation—from condition recommendation and protocol generation to physical validation—without human intervention. It significantly outperforms conventional RCO approaches in accuracy and represents the first demonstration of a “self-driving laboratory.”
📝 Abstract
Recent AI research plots a promising future of automatic chemical reactions within the chemistry society. This study proposes Chemist-X, a comprehensive AI agent that automates the reaction condition optimization (RCO) task in chemical synthesis with retrieval-augmented generation (RAG) technology and AI-controlled wet-lab experiment executions. To begin with, as an emulation on how chemical experts solve the RCO task, Chemist-X utilizes a novel RAG scheme to interrogate available molecular and literature databases to narrow the searching space for later processing. The agent then leverages a computer-aided design (CAD) tool we have developed through a large language model (LLM) supervised programming interface. With updated chemical knowledge obtained via RAG, as well as the ability in using CAD tools, our agent significantly outperforms conventional RCO AIs confined to the fixed knowledge within its training data. Finally, Chemist-X interacts with the physical world through an automated robotic system, which can validate the suggested chemical reaction condition without human interventions. The control of the robotic system was achieved with a novel algorithm we have developed for the equipment, which relies on LLMs for reliable script generation. Results of our automatic wet-lab experiments, achieved by fully LLM-supervised end-to-end operation with no human in the lope, prove Chemist-X's ability in self-driving laboratories.
Problem

Research questions and friction points this paper is trying to address.

Automates reaction condition optimization in chemical synthesis
Leverages retrieval-augmented generation for molecular database interrogation
Enables autonomous wet-lab experiments via LLM-controlled robotic systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses RAG for molecular database interrogation
LLM-supervised CAD tool for chemical design
Automated robotic system for lab validation
Kexin Chen
Kexin Chen
CUHK
LLM/VLMsAI AgentMulti-modality LearningAI for Science
J
Junyou Li
Zhejiang Lab, Hangzhou, China.
Kunyi Wang
Kunyi Wang
UBC; KAUST
VisionGraphics
Yuyang Du
Yuyang Du
Department of Information Engineering, CUHK
Generative AIsWireless CommunicationNetworking
Jiahui Yu
Jiahui Yu
Research Scientist, OpenAI
Artificial Intelligence
J
Jiamin Lu
Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.; Institute of Microanalytical Systems, Department of Chemistry, Zhejiang University, Hangzhou, China.
Lanqing Li
Lanqing Li
Zhejiang Lab, The Chinese University of Hong Kong
Machine LearningAI for ScienceReinforcement LearningAI for Drug Discovery
Jiezhong Qiu
Jiezhong Qiu
Zhejiang University - Zhejiang Lab Hundred Talents Program Researcher
Data MiningSocial Network AnalysisNetwork EmbeddingGraph Neural Networks
J
Jianzhang Pan
Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.; Institute of Microanalytical Systems, Department of Chemistry, Zhejiang University, Hangzhou, China.
Y
Yi Huang
Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.; Institute of Microanalytical Systems, Department of Chemistry, Zhejiang University, Hangzhou, China.
Q
Qun Fang
Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.; Institute of Microanalytical Systems, Department of Chemistry, Zhejiang University, Hangzhou, China.
P
P. Heng
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China.
G
Guangyong Chen
Zhejiang Lab, Hangzhou, China.