Text-Augmented Multimodal LLMs for Chemical Reaction Condition Recommendation

📅 2024-07-21
🏛️ arXiv.org
📈 Citations: 1
Influential: 1
📄 PDF
🤖 AI Summary
To address the long-standing challenges of high experimental cost and low throughput efficiency in reaction condition (RC) optimization for chemical and pharmaceutical applications, this paper introduces MM-RCR, a multimodal large language model. Methodologically, MM-RCR is the first to unify SMILES strings, reaction graphs, and domain-specific textual descriptions into a coherent multimodal reaction representation. We construct a high-quality instruction-tuning dataset comprising 1.2 million Q&A pairs and integrate cross-modal alignment learning with supervised instruction fine-tuning—significantly enhancing generalization under out-of-distribution (OOD) settings and high-throughput experimentation (HTE). Evaluated on two public benchmarks, MM-RCR achieves state-of-the-art performance, demonstrating superior accuracy in RC recommendation and accelerated screening. This work establishes a novel paradigm for automated synthetic route design.

Technology Category

Application Category

📝 Abstract
High-throughput reaction condition (RC) screening is fundamental to chemical synthesis. However, current RC screening suffers from laborious and costly trial-and-error workflows. Traditional computer-aided synthesis planning (CASP) tools fail to find suitable RCs due to data sparsity and inadequate reaction representations. Nowadays, large language models (LLMs) are capable of tackling chemistry-related problems, such as molecule design, and chemical logic Q&A tasks. However, LLMs have not yet achieved accurate predictions of chemical reaction conditions. Here, we present MM-RCR, a text-augmented multimodal LLM that learns a unified reaction representation from SMILES, reaction graphs, and textual corpus for chemical reaction recommendation (RCR). To train MM-RCR, we construct 1.2 million pair-wised Q&A instruction datasets. Our experimental results demonstrate that MM-RCR achieves state-of-the-art performance on two open benchmark datasets and exhibits strong generalization capabilities on out-of-domain (OOD) and High-Throughput Experimentation (HTE) datasets. MM-RCR has the potential to accelerate high-throughput condition screening in chemical synthesis.
Problem

Research questions and friction points this paper is trying to address.

Identifying broadly applicable reaction conditions across diverse chemical substrates
Developing a universal approach for reliable discovery of effective reaction conditions
Accelerating high-throughput condition screening to reduce labor-intensive trial-and-error
Innovation

Methods, ideas, or system contributions that make the work stand out.

Text-augmented multimodal LLM for chemical condition recommendation
Unified representation learning from text, SMILES, and reaction graphs
Task-specific dialogue and condition generation for synthesis optimization
🔎 Similar Papers
No similar papers found.
Y
Yu Zhang
MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University
R
Ruijie Yu
MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University
K
Kaipeng Zeng
MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University
D
Ding Li
MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University
F
Feng Zhu
Frontiers Science Center for Transformative Molecules (FSCTM), Shanghai Jiao Tong University
X
Xiaokang Yang
MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University
Yaohui Jin
Yaohui Jin
Shanghai Jiao Tong University
Y
Yanyan Xu
MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University