MADD: Multi-Agent Drug Discovery Orchestra

📅 2025-11-11
🏛️ Conference on Empirical Methods in Natural Language Processing
📈 Citations: 0
Influential: 0
📄 PDF

career value

210K/year
🤖 AI Summary
In early-stage drug discovery, hit identification remains costly, and AI tools exhibit high usability barriers for wet-lab researchers. This work introduces the first multi-agent collaborative system tailored for drug discovery, which automatically translates natural-language instructions into end-to-end molecular generation and virtual screening workflows by integrating large language models (LLMs) with domain-specific molecular modeling tools—including molecular docking, generative modeling, and property-based filtering. Key contributions include: (1) AI-first de novo design against five novel therapeutic targets; (2) construction of the first benchmark dataset comprising over 3 million query–molecule pairs; and (3) statistically significant performance gains across seven evaluation tasks versus state-of-the-art LLM-based approaches, culminating in the discovery and public release of multiple validated hit candidates. The system substantially enhances accessibility and practical utility of AI-driven drug discovery for experimental biologists and medicinal chemists.

Technology Category

Application Category

📝 Abstract
Hit identification is a central challenge in early drug discovery, traditionally requiring substantial experimental resources. Recent advances in artificial intelligence, particularly large language models (LLMs), have enabled virtual screening methods that reduce costs and improve efficiency. However, the growing complexity of these tools has limited their accessibility to wet-lab researchers. Multi-agent systems offer a promising solution by combining the interpretability of LLMs with the precision of specialized models and tools. In this work, we present MADD, a multi-agent system that builds and executes customized hit identification pipelines from natural language queries. MADD employs four coordinated agents to handle key subtasks in de novo compound generation and screening. We evaluate MADD across seven drug discovery cases and demonstrate its superior performance compared to existing LLM-based solutions. Using MADD, we pioneer the application of AI-first drug design to five biological targets and release the identified hit molecules. Finally, we introduce a new benchmark of query-molecule pairs and docking scores for over three million compounds to contribute to the agentic future of drug design.
Problem

Research questions and friction points this paper is trying to address.

Addresses hit identification challenges in early drug discovery
Improves accessibility of complex AI tools for wet-lab researchers
Enhances virtual screening through multi-agent orchestrated pipelines
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent system orchestrates customized drug discovery pipelines
Four coordinated agents handle compound generation and screening
Natural language queries build and execute hit identification workflows
💼 Related Jobs
AI Data Engineer--LLMs / Agentic Systems
Pfizer
The annual base salary for this position ranges from $106,000.00 to $176,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Cambridge
G
Gleb V. Solovev
ITMO University, Saint Petersburg, Russia
A
A. B. Zhidkovskaya
ITMO University, Saint Petersburg, Russia
A
Anastasia Orlova
ITMO University, Saint Petersburg, Russia
Nina Gubina
Nina Gubina
ITMO University
computer aided drug designapplied artificial intelligencecheminformatics
A
Anastasia Vepreva
ITMO University, Saint Petersburg, Russia
R
Rodion Golovinskii
ITMO University, Saint Petersburg, Russia
I
Ilya Tonkii
ITMO University, Saint Petersburg, Russia
I
Ivan Dubrovsky
ITMO University, Saint Petersburg, Russia
I
Ivan Gurev
ITMO University, Saint Petersburg, Russia
D
Dmitry Gilemkhanov
ITMO University, Saint Petersburg, Russia
D
Denis Chistiakov
ITMO University, Saint Petersburg, Russia
T
Timur A. Aliev
ITMO University, Saint Petersburg, Russia
I
Ivan Poddiakov
Sber AI Lab, Moscow, Russia
Galina Zubkova
Galina Zubkova
Sber AI Lab, Moscow, Russia
E
E. Skorb
ITMO University, Saint Petersburg, Russia
V
Vladimir Vinogradov
ITMO University, Saint Petersburg, Russia
Alexander Boukhanovsky
Alexander Boukhanovsky
ITMO University, Saint Petersburg, Russia
N
Nikolay O. Nikitin
ITMO University, Saint Petersburg, Russia
A
A. Dmitrenko
ITMO University, Saint Petersburg, Russia; D ONE AG, Zurich, Switzerland
A
A. Kalyuzhnaya
ITMO University, Saint Petersburg, Russia
Andrey Savchenko
Andrey Savchenko
Sber AI Lab; HSE University - Nizhny Novgorod
Computer VisionPattern RecognitionMachine LearningSpeech ProcessingImage Processing