R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution

📅 2025-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

240K/year
🤖 AI Summary
Current automated data science systems face performance bottlenecks and heavy reliance on domain expertise, with existing approaches struggling to balance efficiency and accuracy. To address this, we propose the first research–development dual-agent collaborative framework: a Researcher agent generates improvement strategies based on performance feedback, while a Developer agent iteratively refines code guided by error signals; the two agents coordinate dynamically via dual closed-loop interaction, enabling multi-path parallel exploration, dynamic trajectory fusion, and result aggregation. Built upon large language models (LLMs), the framework integrates feedback-driven code generation, correction, and search-enhanced optimization. Evaluated on the MLE-Bench benchmark, it achieves state-of-the-art performance and ranks first on the Machine Learning Engineering Agent Leaderboard. Open-sourced implementation demonstrates strong cross-task generalization and practical engineering applicability.

Technology Category

Application Category

📝 Abstract
Recent advances in AI and ML have transformed data science, yet increasing complexity and expertise requirements continue to hinder progress. While crowdsourcing platforms alleviate some challenges, high-level data science tasks remain labor-intensive and iterative. To overcome these limitations, we introduce R&D-Agent, a dual-agent framework for iterative exploration. The Researcher agent uses performance feedback to generate ideas, while the Developer agent refines code based on error feedback. By enabling multiple parallel exploration traces that merge and enhance one another, R&D-Agent narrows the gap between automated solutions and expert-level performance. Evaluated on MLE-Bench, R&D-Agent emerges as the top-performing machine learning engineering agent, demonstrating its potential to accelerate innovation and improve precision across diverse data science applications. We have open-sourced R&D-Agent on GitHub: https://github.com/microsoft/RD-Agent.
Problem

Research questions and friction points this paper is trying to address.

Automating complex AI solution development to reduce expertise barriers
Enhancing iterative data science tasks with dual-agent collaboration
Bridging performance gap between automated and expert-level ML solutions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-agent framework for iterative exploration
Researcher generates ideas via performance feedback
Developer refines code based on error feedback
💼 Related Jobs
AI Data Engineer--LLMs / Agentic Systems
Pfizer
The annual base salary for this position ranges from $106,000.00 to $176,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Cambridge
X
Xu Yang
Microsoft Research Asia
X
Xiao Yang
Microsoft Research Asia
S
Shikai Fang
Microsoft Research Asia
B
Bowen Xian
Microsoft Research Asia
Yuante Li
Yuante Li
Carnegie Mellon University
AI ScientistMulti-Agent SystemLarge Language ModelsData MiningAI For Finance
J
Jian Wang
Microsoft Research Asia
Minrui Xu
Minrui Xu
Nanyang Technological University
LLMs for NetworksQuantum InternetMetaverseNetwork EconomicsDRL
H
Haoran Pan
Microsoft Research Asia
X
Xinpeng Hong
Microsoft Research Asia
W
Weiqing Liu
Microsoft Research Asia
Yelong Shen
Yelong Shen
Microsoft
NLPMachine Learning
Weizhu Chen
Weizhu Chen
Microsoft, Technical Fellow
Deep LearningNLPNatural Language Processingmachine learning
J
Jiang Bian
Microsoft Research Asia