MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems

📅 2025-05-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current LLM-based multi-agent system (MAS) research suffers from critical challenges: absence of shared codebases, methodological fragmentation, poor reproducibility, and inconsistent evaluation. To address these, we introduce the first unified, open-source framework specifically designed for LLM-MAS research. It systematically integrates over 20 state-of-the-art methods within a modular Python architecture, supporting multi-model API interoperability, pluggable communication protocols, structured task orchestration, and a standardized cross-benchmark evaluation pipeline. The framework covers 10+ established benchmarks and 8 categories of LLMs; all integrated methods undergo rigorous consistency verification against their official implementations. This significantly reduces development and evaluation overhead for novel MAS approaches, thereby advancing standardization, reproducibility, and collaborative progress in LLM-MAS research.

Technology Category

Application Category

📝 Abstract
LLM-based multi-agent systems (MAS) have demonstrated significant potential in enhancing single LLMs to address complex and diverse tasks in practical applications. Despite considerable advancements, the field lacks a unified codebase that consolidates existing methods, resulting in redundant re-implementation efforts, unfair comparisons, and high entry barriers for researchers. To address these challenges, we introduce MASLab, a unified, comprehensive, and research-friendly codebase for LLM-based MAS. (1) MASLab integrates over 20 established methods across multiple domains, each rigorously validated by comparing step-by-step outputs with its official implementation. (2) MASLab provides a unified environment with various benchmarks for fair comparisons among methods, ensuring consistent inputs and standardized evaluation protocols. (3) MASLab implements methods within a shared streamlined structure, lowering the barriers for understanding and extension. Building on MASLab, we conduct extensive experiments covering 10+ benchmarks and 8 models, offering researchers a clear and comprehensive view of the current landscape of MAS methods. MASLab will continue to evolve, tracking the latest developments in the field, and invite contributions from the broader open-source community.
Problem

Research questions and friction points this paper is trying to address.

Lack of unified codebase for LLM-based multi-agent systems
Redundant re-implementation and unfair method comparisons
High entry barriers for MAS researchers
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates 20+ validated methods across domains
Provides unified benchmarks for fair comparisons
Shared streamlined structure lowers entry barriers
🔎 Similar Papers
No similar papers found.
R
Rui Ye
Shanghai Jiao Tong University
K
Keduan Huang
Shanghai Jiao Tong University
Q
Qimin Wu
Shanghai Jiao Tong University
Y
Yuzhu Cai
Shanghai Jiao Tong University
T
Tian Jin
Shanghai Jiao Tong University
Xianghe Pang
Xianghe Pang
Shanghai Jiao Tong University
LLM Agent
X
Xiangrui Liu
Shanghai Jiao Tong University
Jiaqi Su
Jiaqi Su
Adobe Research
audiospeechdeep learning
C
Chen Qian
Shanghai Jiao Tong University
Bohan Tang
Bohan Tang
DPhil Student, University of Oxofrd
LLM AgentMachine LearningDeep Learning
K
Kaiqu Liang
Princeton University
Jiaao Chen
Jiaao Chen
Meta
Machine LearningNatural Language Processing
Y
Yue Hu
University of Michigan
Zhenfei Yin
Zhenfei Yin
University of Oxford
Deep LearningMultimodalAI AgentRobotics
Rongye Shi
Rongye Shi
Beihang University
multi-agent reinforcement learningdeep learningintelligent transportation systems
B
Bo An
Nanyang Technological University
Y
Yang Gao
Nanjing University
W
Wenjun Wu
Beihang University
Lei Bai
Lei Bai
Shanghai AI Laboratory
Foundation ModelScience IntelligenceMulti-Agent SystemAutonomous Discovery
Siheng Chen
Siheng Chen
Shanghai Jiao Tong University
Collective intelligenceLLM agentgraph signal processingcollaborative perception