MolClaw: An Autonomous Agent with Hierarchical Skills for Drug Molecule Evaluation, Screening, and Optimization

📅 2026-04-02
🏛️ bioRxiv
📈 Citations: 0
Influential: 0
📄 PDF

career value

216K/year
🤖 AI Summary
Current AI agents struggle to coordinate multi-step, multi-tool workflows in complex drug molecule screening and optimization, limiting their performance. This work proposes the first three-tiered hierarchical skill architecture tailored for drug discovery—comprising tool, workflow, and domain layers—that integrates over 30 computational chemistry and AI tools and embeds domain-knowledge-driven mechanisms for planning, validation, and reflection. Building upon this framework, the authors introduce MolBench, a comprehensive benchmark featuring tasks requiring 8 to 50+ tool invocations. Experiments demonstrate that the proposed approach achieves state-of-the-art performance across MolBench, and ablation studies confirm that its superiority stems from its structured workflow coordination capability, revealing such coordination as a critical bottleneck in AI-driven drug discovery.
📝 Abstract
Computational drug discovery, particularly the complex workflows of drug molecule screening and optimization, requires orchestrating dozens of specialized tools in multi-step workflows, yet current AI agents struggle to maintain robust performance and consistently underperform in these high-complexity scenarios. Here we present MolClaw, an autonomous agent that leads drug molecule evaluation, screening, and optimization. It unifies over 30 specialized domain resources through a three-tier hierarchical skill architecture (70 skills in total) that facilitates agent long-term interaction at runtime: tool-level skills standardize atomic operations, workflow-level skills compose them into validated pipelines with quality check and reflection, and a discipline-level skill supplies scientific principles governing planning and verification across all scenarios in the field. Additionally, we introduce MolBench, a benchmark comprising molecular screening, optimization, and end-to-end discovery challenges spanning 8 to 50+ sequential tool calls. MolClaw achieves state-of-the-art performance across all metrics, and ablation studies confirm that gains concentrate on tasks that demand structured workflows while vanishing on those solvable with ad hoc scripting, establishing workflow orchestration competence as the primary capability bottleneck for AI-driven drug discovery.
Problem

Research questions and friction points this paper is trying to address.

computational drug discovery
molecule screening
molecule optimization
AI agent
workflow orchestration
Innovation

Methods, ideas, or system contributions that make the work stand out.

hierarchical skill architecture
workflow orchestration
autonomous agent
computational drug discovery
MolBench
L
Lisheng Zhang
Peking University Health Science Center, Peking University, Beijing, China
L
Lilong Wang
Shanghai AI Laboratory, Shanghai, China
Xiangyu Sun
Xiangyu Sun
Huazhong University of Science and Technology, Sungkyunkwan University
3D Gaussian Splatting3D Generative modelNeRF
W
Wei Tang
Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
H
Haoyang Su
Shanghai AI Laboratory, Shanghai, China
Yuehui Qian
Yuehui Qian
University of Maryland, College Park
Terrain analysisSpatial data structureRemote sensingMachine LearningApache Spark
Q
Qikui Yang
Peking University Health Science Center, Peking University, Beijing, China
Q
Qingsong Li
Peking University Health Science Center, Peking University, Beijing, China
Zhenyu Tang
Zhenyu Tang
Shanghai Jiao Tong University
Computer Vision
H
Haoran Sun
Shanghai AI Laboratory, Shanghai, China
Y
Yingnan Han
Shanghai AI Laboratory, Shanghai, China
Yankai Jiang
Yankai Jiang
Shanghai AI Laboratory
Multimodal LLMVision-Language PretrainingAI for Science
W
Wenjie Lou
Shanghai AI Laboratory, Shanghai, China
B
Bowen Zhou
Shanghai AI Laboratory, Shanghai, China
Xiaosong Wang
Xiaosong Wang
Shanghai AI Laboratory
Medical Image AnalysisComputer VisionVision and Language
Lei Bai
Lei Bai
Shanghai AI Laboratory
Foundation ModelScience IntelligenceMulti-Agent SystemAutonomous Discovery
Z
Zhengwei Xie
Peking University Health Science Center, Peking University, Beijing, China