Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

📅 2026-03-30
📈 Citations: 0
✨ Influential: 0
📄 PDF
🤖 AI Summary
Existing autonomous scientific research systems are constrained by static workflows and fixed toolsets, limiting their adaptability to dynamic research tasks. This work proposes an evolvable, open multi-agent framework in which a meta-orchestrator dynamically constructs task-specific workflows, and code-generation agents execute subtasks by invoking a scientific tool library. The system iteratively refines its processes through LLM-based evaluation and feedback, enabling dynamic tool discovery, self-evolving workflows, and full auditability. Built upon the Model Context Protocol (MCP), the architecture supports flexible agent coordination. Evaluated on ScienceAgentBench using DeepSeek-V3.2, the framework achieves a 43.1% success rate—significantly outperforming both single-agent and static multi-agent baselines—and demonstrates strong cross-disciplinary generalization capabilities.
📝 Abstract
Current Autonomous Scientific Research (ASR) systems, despite leveraging large language models (LLMs) and agentic architectures, remain constrained by fixed workflows and toolsets that prevent adaptation to evolving tasks and environments. We introduce Mimosa, an evolving multi-agent framework that automatically synthesizes task-specific multi-agent workflows and iteratively refines them through experimental feedback. Mimosa leverages the Model Context Protocol (MCP) for dynamic tool discovery, generates workflow topologies via a meta-orchestrator, executes subtasks through code-generating agents that invoke available tools and scientific software libraries, and scores executions with an LLM-based judge whose feedback drives workflow refinement. On ScienceAgentBench, Mimosa achieves a success rate of 43.1% with DeepSeek-V3.2, surpassing both single-agent baselines and static multi-agent configurations. Our results further reveal that models respond heterogeneously to multi-agent decomposition and iterative learning, indicating that the benefits of workflow evolution depend on the capabilities of the underlying execution model. Beyond these benchmarks, Mimosa modular architecture and tool-agnostic design make it readily extensible, and its fully logged execution traces and archived workflows support auditability by preserving every analytical step for inspection and potential replication. Combined with domain-expert guidance, the framework has the potential to automate a broad range of computationally accessible scientific tasks across disciplines. Released as a fully open-source platform, Mimosa aims to provide an open foundation for community-driven ASR.
Problem

Research questions and friction points this paper is trying to address.

Autonomous Scientific Research
multi-agent systems
workflow adaptation
toolset flexibility
evolving tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

evolving multi-agent systems
autonomous scientific research
workflow synthesis
dynamic tool discovery
iterative refinement
M
Martin Legrand
Université Côte d’Azur, CNRS, ICN, Nice, France; Interdisciplinary Institute for Artificial Intelligence (3iA) Côte d’Azur, Sophia-Antipolis, France
T
Tao Jiang
Université Côte d’Azur, CNRS, ICN, Nice, France; Interdisciplinary Institute for Artificial Intelligence (3iA) Côte d’Azur, Sophia-Antipolis, France
M
Matthieu Feraud
Université Côte d’Azur, CNRS, ICN, Nice, France; Interdisciplinary Institute for Artificial Intelligence (3iA) Côte d’Azur, Sophia-Antipolis, France
B
Benjamin Navet
Université Côte d’Azur, CNRS, ICN, Nice, France; Interdisciplinary Institute for Artificial Intelligence (3iA) Côte d’Azur, Sophia-Antipolis, France; Inria, Université Côte d’Azur, CNRS, I3S, France
Yousouf Taghzouti
Yousouf Taghzouti
Univ. CĂ´te d'Azur, Inria, ICN, I3S, France
Knowledge RepresentationSemantic WebArtificial IntelligenceLLMContent Negotiation
Fabien Gandon
Fabien Gandon
INRIA
webartificial intelligenceknowledge graphs and linked datasemantic web and ontologyknowledge representation and reasonin
E
Elise Dumont
Université Côte d’Azur, CNRS, ICN, Nice, France
Louis-FĂŠlix Nothias
Louis-FĂŠlix Nothias
Chaire de Professeur Junior CNRS - UniversitĂŠ Cote d'Azur
MetabolomicsMass Spectrometry