360∘REA: Towards A Reusable Experience Accumulation with 360∘ Assessment for Multi-Agent System

📅 2024-04-08
🏛️ Findings of the Association for Computational Linguistics ACL 2024
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Multi-agent systems face performance bottlenecks in complex tasks due to coarse-grained evaluation and poor reusability of learned experience. Method: We propose an evaluation-driven, reusable experience accumulation framework featuring a novel 360° fine-grained multi-dimensional evaluation mechanism, a hierarchical multi-agent architecture, a dual-level experience pool (with structured storage and semantic retrieval), and an LLM-powered self-reflection and experience distillation module—enabling a closed “evaluation–feedback–evolution” loop. Contribution/Results: This work pioneers the integration of an organized evaluation paradigm into multi-LLM collaborative systems, significantly enhancing long-term team performance and cross-task generalization. Extensive experiments across multiple complex task benchmarks demonstrate consistent and substantial improvements over state-of-the-art baselines, validating both the effectiveness and scalability of evaluation-driven experience accumulation.

Technology Category

Application Category

📝 Abstract
Large language model agents have demonstrated remarkable advancements across various complex tasks. Recent works focus on optimizing the agent team or employing self-reflection to iteratively solve complex tasks. Since these agents are all based on the same LLM, only conducting self-evaluation or removing underperforming agents does not substantively enhance the capability of the agents. We argue that a comprehensive evaluation and accumulating experience from evaluation feedback is an effective approach to improving system performance. In this paper, we propose Reusable Experience Accumulation with 360$^circ$ Assessment (360$^circ$REA), a hierarchical multi-agent framework inspired by corporate organizational practices. The framework employs a novel 360$^circ$ performance assessment method for multi-perspective performance evaluation with fine-grained assessment. To enhance the capability of agents in addressing complex tasks, we introduce dual-level experience pool for agents to accumulate experience through fine-grained assessment. Extensive experiments on complex task datasets demonstrate the effectiveness of 360$^circ$REA.
Problem

Research questions and friction points this paper is trying to address.

Enhance multi-agent system performance through comprehensive evaluation.
Accumulate reusable experience using fine-grained assessment feedback.
Improve agent capability in complex tasks with hierarchical framework.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical multi-agent framework for complex tasks
360-degree performance assessment for multi-perspective evaluation
Dual-level experience pool for agent capability enhancement
🔎 Similar Papers
No similar papers found.
S
Shen Gao
University of Electronic Science and Technology of China
H
Hao Li
Shandong University
Chengrui Huang
Chengrui Huang
University of Electronic Science and Technology of China
Natural Language ProcessingTool Learning
Quan Tu
Quan Tu
Renmin University of China
Dialogue SystemText GenerationInformation Retrieval
Z
Zhiliang Tian
National University of Defense Technology
M
Minlie Huang
Tsinghua University
Shuo Shang
Shuo Shang
Computer Science & AI Scientist
Spatial dataSpatiotemporal databases