R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution

📅 2025-05-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current automated data science systems face performance bottlenecks and heavy reliance on domain expertise, with existing approaches struggling to balance efficiency and accuracy. To address this, we propose the first research–development dual-agent collaborative framework: a Researcher agent generates improvement strategies based on performance feedback, while a Developer agent iteratively refines code guided by error signals; the two agents coordinate dynamically via dual closed-loop interaction, enabling multi-path parallel exploration, dynamic trajectory fusion, and result aggregation. Built upon large language models (LLMs), the framework integrates feedback-driven code generation, correction, and search-enhanced optimization. Evaluated on the MLE-Bench benchmark, it achieves state-of-the-art performance and ranks first on the Machine Learning Engineering Agent Leaderboard. Open-sourced implementation demonstrates strong cross-task generalization and practical engineering applicability.

Technology Category

Application Category

📝 Abstract
Recent advances in AI and ML have transformed data science, yet increasing complexity and expertise requirements continue to hinder progress. While crowdsourcing platforms alleviate some challenges, high-level data science tasks remain labor-intensive and iterative. To overcome these limitations, we introduce R&D-Agent, a dual-agent framework for iterative exploration. The Researcher agent uses performance feedback to generate ideas, while the Developer agent refines code based on error feedback. By enabling multiple parallel exploration traces that merge and enhance one another, R&D-Agent narrows the gap between automated solutions and expert-level performance. Evaluated on MLE-Bench, R&D-Agent emerges as the top-performing machine learning engineering agent, demonstrating its potential to accelerate innovation and improve precision across diverse data science applications. We have open-sourced R&D-Agent on GitHub: https://github.com/microsoft/RD-Agent.
Problem

Research questions and friction points this paper is trying to address.

Automating complex AI solution development to reduce expertise barriers
Enhancing iterative data science tasks with dual-agent collaboration
Bridging performance gap between automated and expert-level ML solutions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-agent framework for iterative exploration
Researcher generates ideas via performance feedback
Developer refines code based on error feedback
X
Xu Yang
Microsoft Research Asia
X
Xiao Yang
Microsoft Research Asia
S
Shikai Fang
Microsoft Research Asia
B
Bowen Xian
Microsoft Research Asia
Yuante Li
Yuante Li
Carnegie Mellon University
AI ScientistMulti-Agent SystemLarge Language ModelsData MiningAI For Finance
J
Jian Wang
Microsoft Research Asia
Minrui Xu
Minrui Xu
Nanyang Technological University
LLMs for NetworksQuantum InternetMetaverseNetwork EconomicsDRL
H
Haoran Pan
Microsoft Research Asia
X
Xinpeng Hong
Microsoft Research Asia
W
Weiqing Liu
Microsoft Research Asia
Yelong Shen
Yelong Shen
Microsoft
NLPMachine Learning
Weizhu Chen
Weizhu Chen
Microsoft, Technical Fellow
Deep LearningNLPNatural Language Processingmachine learning
J
Jiang Bian
Microsoft Research Asia