Fast and scalable retrosynthetic planning with a transformer neural network and speculative beam search

๐Ÿ“… 2025-08-02
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
AI-driven computer-aided synthetic planning (CASP) suffers from high inference latency, hindering high-throughput synthesizability screening in de novo drug design. To address this, we propose a low-latency, multi-step retrosynthetic planning acceleration framework tailored for Transformer architectures. Our method innovatively integrates speculative beam search with a scalable Medusa draft strategy, enabling parallelized, low-latency inference atop a SMILES-to-SMILES single-step predictor. Experiments under strict time budgets of several seconds demonstrate that our approach improves molecular solution coverage by 26%โ€“86% over baseline methods, substantially enhancing retrosynthetic throughput. This work represents the first systematic application of speculative decoding to CASP, establishing an efficient and practical pathway for real-time, high-throughput assessment of drug molecule synthesizability.

Technology Category

Application Category

๐Ÿ“ Abstract
AI-based computer-aided synthesis planning (CASP) systems are in demand as components of AI-driven drug discovery workflows. However, the high latency of such CASP systems limits their utility for high-throughput synthesizability screening in de novo drug design. We propose a method for accelerating multi-step synthesis planning systems that rely on SMILES-to-SMILES transformers as single-step retrosynthesis models. Our approach reduces the latency of SMILES-to-SMILES transformers powering multi-step synthesis planning in AiZynthFinder through speculative beam search combined with a scalable drafting strategy called Medusa. Replacing standard beam search with our approach allows the CASP system to solve 26% to 86% more molecules under the same time constraints of several seconds. Our method brings AI-based CASP systems closer to meeting the strict latency requirements of high-throughput synthesizability screening and improving general user experience.
Problem

Research questions and friction points this paper is trying to address.

Reducing high latency in AI-based synthesis planning systems
Accelerating multi-step retrosynthesis with transformer models
Meeting strict latency needs for high-throughput drug screening
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer neural network for retrosynthetic planning
Speculative beam search to reduce latency
Scalable Medusa drafting strategy
๐Ÿ”Ž Similar Papers
No similar papers found.
Mikhail Andronov
Mikhail Andronov
PhD Student, IDSIA/USI/SUPSI
CheminformaticsMachine LearningML for ChemistryAIDrug Discovery
Natalia Andronova
Natalia Andronova
Unknown affiliation
CheminformaticsAIphysics
M
Michael Wand
SUPSI, IDSIA, Lugano, Switzerland.; Institute for Digital Technologies for Personalized Healthcare, SUPSI, Lugano, Switzerland.
J
Jรผrgen Schmidhuber
Center for Generative AI, KAUST, Thuwal, Saudi Arabia.
Djork-Arnรฉ Clevert
Djork-Arnรฉ Clevert
Pfizer, VP, Machine Learning Research
Drug DiscoveryMachine LearningDeep LearningComputational ChemistryComputational Biology