ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

📅 2025-09-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Chemical Bayesian optimization (BO) suffers from low search efficiency due to sparse experimental data and complex reaction mechanisms. To address this, we propose an LLM-augmented multi-agent collaborative optimization framework. Our method leverages large language models (LLMs) for knowledge-guided decomposition of the reaction space, enabling synergistic coarse-grained pathway planning and fine-grained parameter optimization; additionally, it generates high-fidelity pseudo-data grounded in chemical mechanistic principles to alleviate data scarcity. By integrating knowledge-driven reasoning with data-driven modeling, the framework significantly outperforms conventional BO across multiple benchmark tasks. Wet-lab validation demonstrates a 96% success rate in achieving target reaction outcomes—over sixfold higher than the domain expert baseline (15%)—establishing a scalable new paradigm for high-throughput chemical optimization.

Technology Category

Application Category

📝 Abstract
The efficiency of Bayesian optimization (BO) in chemistry is often hindered by sparse experimental data and complex reaction mechanisms. To overcome these limitations, we introduce ChemBOMAS, a new framework named LLM-Enhanced Multi-Agent System for accelerating BO in chemistry. ChemBOMAS's optimization process is enhanced by LLMs and synergistically employs two strategies: knowledge-driven coarse-grained optimization and data-driven fine-grained optimization. First, in the knowledge-driven coarse-grained optimization stage, LLMs intelligently decompose the vast search space by reasoning over existing chemical knowledge to identify promising candidate regions. Subsequently, in the data-driven fine-grained optimization stage, LLMs enhance the BO process within these candidate regions by generating pseudo-data points, thereby improving data utilization efficiency and accelerating convergence. Benchmark evaluations** further confirm that ChemBOMAS significantly enhances optimization effectiveness and efficiency compared to various BO algorithms. Importantly, the practical utility of ChemBOMAS was validated through wet-lab experiments conducted under pharmaceutical industry protocols, targeting conditional optimization for a previously unreported and challenging chemical reaction. In the wet experiment, ChemBOMAS achieved an optimal objective value of 96%. This was substantially higher than the 15% achieved by domain experts. This real-world success, together with strong performance on benchmark evaluations, highlights ChemBOMAS as a powerful tool to accelerate chemical discovery.
Problem

Research questions and friction points this paper is trying to address.

Accelerating Bayesian optimization for chemical reactions
Overcoming sparse data and complex reaction mechanisms
Enhancing optimization with LLM-driven multi-agent strategies
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-Enhanced Multi-Agent System for chemical optimization
Knowledge-driven coarse-grained optimization using chemical reasoning
Data-driven fine-grained optimization with pseudo-data generation
D
Dong Han
Shanghai Artificial Intelligence Laboratory, Shanghai, China
Z
Zhehong Ai
Shanghai Artificial Intelligence Laboratory, Shanghai, China
P
Pengxiang Cai
Shanghai Artificial Intelligence Laboratory, Shanghai, China
Shuzhou Sun
Shuzhou Sun
University of Oulu
Deep learningComputer visionCausal inference
S
Shanya Lu
Tongji University, Shanghai, China
Jianpeng Chen
Jianpeng Chen
Shanghai Artificial Intelligence Laboratory, Shanghai, China
Ben Gao
Ben Gao
Wuhan University
AI4SicenceMLAsymmetric CatalysisDistillationASSB
L
Lingli Ge
Shanghai Artificial Intelligence Laboratory, Shanghai, China
W
Weida Wang
Shanghai Artificial Intelligence Laboratory, Shanghai, China
Xiangxin Zhou
Xiangxin Zhou
Unknown affiliation
Xihui Liu
Xihui Liu
University of Hong Kong, UC Berkeley, CUHK, Tsinghua University
Computer VisionDeep Learning
Mao Su
Mao Su
Shanghai AI Laboratory
PhysicsAI
W
Wanli Ouyang
Shanghai Artificial Intelligence Laboratory, Shanghai, China
Lei Bai
Lei Bai
Shanghai AI Laboratory
Foundation ModelScience IntelligenceMulti-Agent SystemAutonomous Discovery
Dongzhan Zhou
Dongzhan Zhou
Researcher at Shanghai AI Lab
AI4Sciencecomputer visiondeep learning
T
Tao Xu
Tongji University, Shanghai, China
Yuqiang Li
Yuqiang Li
Central South University
Internal Combustion EngineCombustionEmissionsMechansim
S
Shufei Zhang
Shanghai Artificial Intelligence Laboratory, Shanghai, China