EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale

📅 2026-04-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

241K/year
🤖 AI Summary
Current agent frameworks are predominantly static and domain-specific, lacking the capacity for continual learning and evolution, thereby hindering their applicability to large-scale autonomous scientific discovery. This work proposes SciMaster—the first cross-domain, extensible foundational framework for self-evolving scientific agents—leveraging a large language model–driven mechanism for continuous self-improvement. SciMaster enables iterative hypothesis generation, self-critique, and cross-task knowledge accumulation, allowing high-capability research agents to be deployed with only approximately 100 lines of code. The framework achieves state-of-the-art performance on four established benchmarks (41.1%, 75.8%, 73.3%, and 53.3%), outperforming the general-purpose baseline OpenClaw by 159% to 316%, thereby strongly validating its effectiveness and generalization capability.

Technology Category

Application Category

📝 Abstract
The convergence of large language models and agents is catalyzing a new era of scientific discovery: Agentic Science. While the scientific method is inherently iterative, existing agent frameworks are predominantly static, narrowly scoped, and lack the capacity to learn from trial and error. To bridge this gap, we present EvoMaster, a foundational evolving agent framework engineered specifically for Agentic Science at Scale. Driven by the core principle of continuous self-evolution, EvoMaster empowers agents to iteratively refine hypotheses, self-critique, and progressively accumulate knowledge across experimental cycles, faithfully mirroring human scientific inquiry. Crucially, as a domain-agnostic base harness, EvoMaster is exceptionally easy to scale up -- enabling developers to build and deploy highly capable, self-evolving scientific agents for arbitrary disciplines in approximately 100 lines of code. Built upon EvoMaster, we incubated the SciMaster ecosystem across domains such as machine learning, physics, and general science. Evaluations on four authoritative benchmarks (Humanity's Last Exam, MLE-Bench Lite, BrowseComp, and FrontierScience) demonstrate that EvoMaster achieves state-of-the-art scores of 41.1%, 75.8%, 73.3%, and 53.3%, respectively. It comprehensively outperforms the general-purpose baseline OpenClaw with relative improvements ranging from +159% to +316%, robustly validating its efficacy and generality as the premier foundational framework for the next generation of autonomous scientific discovery. EvoMaster is available at https://github.com/sjtu-sai-agents/EvoMaster.
Problem

Research questions and friction points this paper is trying to address.

Agentic Science
autonomous scientific agents
self-evolution
iterative learning
scientific discovery
Innovation

Methods, ideas, or system contributions that make the work stand out.

evolving agents
agentic science
self-evolution
domain-agnostic framework
autonomous scientific discovery
Xinyu Zhu
Xinyu Zhu
Shanghai Jiaotong university
Y
Yuzhu Cai
School of Artificial Intelligence, Shanghai Jiao Tong University
Zexi Liu
Zexi Liu
Shanghai Jiao Tong University
Cheng Wang
Cheng Wang
School of Mathematical Sciences, Shanghai Jiao Tong University
Large dimensional random matrixhigh dimensional data analysispopulation/sample covariance matrix
F
Fengyang Li
School of Artificial Intelligence, Shanghai Jiao Tong University
W
Wenkai Jin
School of Artificial Intelligence, Shanghai Jiao Tong University
W
Wanxu Liu
School of Artificial Intelligence, Shanghai Jiao Tong University
Z
Zehao Bing
School of Artificial Intelligence, Shanghai Jiao Tong University
B
Bingyang Zheng
School of Artificial Intelligence, Shanghai Jiao Tong University
Jingyi Chai
Jingyi Chai
Shanghai Jiao Tong University
Large Language ModelFederated Learning
Shuo Tang
Shuo Tang
Shanghai Jiao Tong University
Rui Ye
Rui Ye
Shanghai Jiao Tong University
Multi-AgentAgentsFederated Learning
Yuwen Du
Yuwen Du
Shanghai Jiao Tong University
Multi-AgentAutonomous Driving Simulation
Xianghe Pang
Xianghe Pang
Shanghai Jiao Tong University
LLM Agent
Yaxin Du
Yaxin Du
Shanghai Jiao Tong University
federated learningLLM agents
T
Tingjia Miao
School of Artificial Intelligence, Shanghai Jiao Tong University
Y
Yuzhi Zhang
SciLand
R
Ruoxue Liao
DP Technology
Z
Zhaohan Ding
DP Technology
Linfeng Zhang
Linfeng Zhang
DP Technology; AI for Science Institute
AI for Sciencemulti-scale modelingmolecular simulationdrug/materials design
Yanfeng Wang
Yanfeng Wang
Shanghai Jiao Tong University
Weinan E
Weinan E
Professor of Mathematics, Princeton University
applied mathematics
Siheng Chen
Siheng Chen
Shanghai Jiao Tong University
Collective intelligenceLLM agentgraph signal processingcollaborative perception