🤖 AI Summary
Current LLM-based multi-agent system (MAS) research suffers from critical challenges: absence of shared codebases, methodological fragmentation, poor reproducibility, and inconsistent evaluation. To address these, we introduce the first unified, open-source framework specifically designed for LLM-MAS research. It systematically integrates over 20 state-of-the-art methods within a modular Python architecture, supporting multi-model API interoperability, pluggable communication protocols, structured task orchestration, and a standardized cross-benchmark evaluation pipeline. The framework covers 10+ established benchmarks and 8 categories of LLMs; all integrated methods undergo rigorous consistency verification against their official implementations. This significantly reduces development and evaluation overhead for novel MAS approaches, thereby advancing standardization, reproducibility, and collaborative progress in LLM-MAS research.
📝 Abstract
LLM-based multi-agent systems (MAS) have demonstrated significant potential in enhancing single LLMs to address complex and diverse tasks in practical applications. Despite considerable advancements, the field lacks a unified codebase that consolidates existing methods, resulting in redundant re-implementation efforts, unfair comparisons, and high entry barriers for researchers. To address these challenges, we introduce MASLab, a unified, comprehensive, and research-friendly codebase for LLM-based MAS. (1) MASLab integrates over 20 established methods across multiple domains, each rigorously validated by comparing step-by-step outputs with its official implementation. (2) MASLab provides a unified environment with various benchmarks for fair comparisons among methods, ensuring consistent inputs and standardized evaluation protocols. (3) MASLab implements methods within a shared streamlined structure, lowering the barriers for understanding and extension. Building on MASLab, we conduct extensive experiments covering 10+ benchmarks and 8 models, offering researchers a clear and comprehensive view of the current landscape of MAS methods. MASLab will continue to evolve, tracking the latest developments in the field, and invite contributions from the broader open-source community.