Fast and scalable retrosynthetic planning with a transformer neural network and speculative beam search

📅 2025-08-02
📈 Citations: 0
Influential: 0
📄 PDF

career value

212K/year
🤖 AI Summary
AI-driven computer-aided synthetic planning (CASP) suffers from high inference latency, hindering high-throughput synthesizability screening in de novo drug design. To address this, we propose a low-latency, multi-step retrosynthetic planning acceleration framework tailored for Transformer architectures. Our method innovatively integrates speculative beam search with a scalable Medusa draft strategy, enabling parallelized, low-latency inference atop a SMILES-to-SMILES single-step predictor. Experiments under strict time budgets of several seconds demonstrate that our approach improves molecular solution coverage by 26%–86% over baseline methods, substantially enhancing retrosynthetic throughput. This work represents the first systematic application of speculative decoding to CASP, establishing an efficient and practical pathway for real-time, high-throughput assessment of drug molecule synthesizability.

Technology Category

Application Category

📝 Abstract
AI-based computer-aided synthesis planning (CASP) systems are in demand as components of AI-driven drug discovery workflows. However, the high latency of such CASP systems limits their utility for high-throughput synthesizability screening in de novo drug design. We propose a method for accelerating multi-step synthesis planning systems that rely on SMILES-to-SMILES transformers as single-step retrosynthesis models. Our approach reduces the latency of SMILES-to-SMILES transformers powering multi-step synthesis planning in AiZynthFinder through speculative beam search combined with a scalable drafting strategy called Medusa. Replacing standard beam search with our approach allows the CASP system to solve 26% to 86% more molecules under the same time constraints of several seconds. Our method brings AI-based CASP systems closer to meeting the strict latency requirements of high-throughput synthesizability screening and improving general user experience.
Problem

Research questions and friction points this paper is trying to address.

Reducing high latency in AI-based synthesis planning systems
Accelerating multi-step retrosynthesis with transformer models
Meeting strict latency needs for high-throughput drug screening
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer neural network for retrosynthetic planning
Speculative beam search to reduce latency
Scalable Medusa drafting strategy
🔎 Similar Papers
No similar papers found.
💼 Related Jobs
AI Data Engineer--LLMs / Agentic Systems
Pfizer
The annual base salary for this position ranges from $106,000.00 to $176,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Cambridge
Mikhail Andronov
Mikhail Andronov
PhD Student, IDSIA/USI/SUPSI
CheminformaticsMachine LearningML for ChemistryAIDrug Discovery
Natalia Andronova
Natalia Andronova
Unknown affiliation
CheminformaticsAIphysics
M
Michael Wand
SUPSI, IDSIA, Lugano, Switzerland.; Institute for Digital Technologies for Personalized Healthcare, SUPSI, Lugano, Switzerland.
J
Jürgen Schmidhuber
Center for Generative AI, KAUST, Thuwal, Saudi Arabia.
Djork-Arné Clevert
Djork-Arné Clevert
Pfizer, VP, Machine Learning Research
Drug DiscoveryMachine LearningDeep LearningComputational ChemistryComputational Biology