FROGENT: An End-to-End Full-process Drug Design Agent

📅 2025-08-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In drug discovery, AI tools are fragmented across isolated platforms with incompatible interfaces and scripting environments, leading to inefficient, redundant workflows. To address this, we propose FROGENT, an end-to-end intelligent agent that—uniquely—integrates, within a unified framework, a dynamic biochemical knowledge base, a heterogeneous toolkit (encompassing molecular generation, virtual screening, synthetic route planning, etc.), and domain-specific AI models. Leveraging large language models (LLMs) and the Model Context Protocol (MCP), FROGENT achieves task understanding, adaptive workflow orchestration, and autonomous decision-making. Evaluated on eight benchmark tasks, FROGENT significantly outperforms existing approaches: it achieves a target–ligand hit-rate retrieval three times higher than the strongest baseline, improves protein–small-molecule interaction prediction accuracy by 100%, and consistently surpasses Qwen3-32B and GPT-4o across all metrics.

Technology Category

Application Category

📝 Abstract
Powerful AI tools for drug discovery reside in isolated web apps, desktop programs, and code libraries. Such fragmentation forces scientists to manage incompatible interfaces and specialized scripts, which can be a cumbersome and repetitive process. To address this issue, a Full-pROcess druG dEsign ageNT, named FROGENT, has been proposed. Specifically, FROGENT utilizes a Large Language Model and the Model Context Protocol to integrate multiple dynamic biochemical databases, extensible tool libraries, and task-specific AI models. This agentic framework allows FROGENT to execute complicated drug discovery workflows dynamically, including component tasks such as target identification, molecule generation and retrosynthetic planning. FROGENT has been evaluated on eight benchmarks that cover various aspects of drug discovery, such as knowledge retrieval, property prediction, virtual screening, mechanistic analysis, molecular design, and synthesis. It was compared against six increasingly advanced ReAct-style agents that support code execution and literature searches. Empirical results demonstrated that FROGENT triples the best baseline performance in hit-finding and doubles it in interaction profiling, significantly outperforming both the open-source model Qwen3-32B and the commercial model GPT-4o. In addition, real-world cases have been utilized to validate the practicability and generalization of FROGENT. This development suggests that streamlining the agentic drug discovery pipeline can significantly enhance researcher productivity.
Problem

Research questions and friction points this paper is trying to address.

Integrates fragmented AI tools for drug discovery
Dynamically executes complex drug design workflows
Enhances performance in hit-finding and interaction profiling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates dynamic biochemical databases and AI models
Uses Large Language Model and Model Context Protocol
Executes complex drug discovery workflows dynamically
🔎 Similar Papers
No similar papers found.
Q
Qihua Pan
School of Artificial Intelligence, Shenzhen University, Shenzhen, China
D
Dong Xu
School of Artificial Intelligence, Shenzhen University, Shenzhen, China
Jenna Xinyi Yao
Jenna Xinyi Yao
University of California, San Diego
AI for science
Lijia Ma
Lijia Ma
Shenzhen University & Nanyang Technological University & Hong Kong Baptist University & Xidian
Artificial intelligenceOptimizationData miningComplex networksMachine Learning
Zexuan Zhu
Zexuan Zhu
Shenzhen University
Evolutionary ComputationMemetic ComputingBioinformaticsMachine Learning
J
Junkai Ji
School of Artificial Intelligence, Shenzhen University, Shenzhen, China