EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration

📅 2026-01-23
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of configuration failures in large-scale software environments, which often stem from complex errors and are inadequately handled by existing approaches due to their lack of fine-grained analysis of agent behaviors and effective repair mechanisms. To overcome this limitation, we propose EvoConfig, a novel framework that introduces a self-evolving multi-agent collaboration mechanism integrated with a fine-grained expert diagnosis module. This enables precise post-execution error identification and the generation of targeted repair strategies, further enhanced by dynamic priority adjustment to optimize the repair process. Evaluated on the EnvBench benchmark, EvoConfig achieves a success rate of 78.1%, outperforming the current state-of-the-art method, Repo2Run, by 7.1%, and demonstrates significant improvements in both error identification accuracy and the effectiveness of repair recommendations.

Technology Category

Application Category

📝 Abstract
A reliable executable environment is the foundation for ensuring that large language models solve software engineering tasks. Due to the complex and tedious construction process, large-scale configuration is relatively inefficient. However, most methods always overlook fine-grained analysis of the actions performed by the agent, making it difficult to handle complex errors and resulting in configuration failures. To address this bottleneck, we propose EvoConfig, an efficient environment configuration framework that optimizes multi-agent collaboration to build correct runtime environments. EvoConfig features an expert diagnosis module for fine-grained post-execution analysis, and a self-evolving mechanism that lets expert agents self-feedback and dynamically adjust error-fixing priorities in real time. Empirically, EvoConfig matches the previous state-of-the-art Repo2Run on Repo2Run's 420 repositories, while delivering clear gains on harder cases: on the more challenging Envbench, EvoConfig achieves a 78.1% success rate, outperforming Repo2Run by 7.1%. Beyond end-to-end success, EvoConfig also demonstrates stronger debugging competence, achieving higher accuracy in error identification and producing more effective repair recommendations than existing methods.
Problem

Research questions and friction points this paper is trying to address.

environment configuration
multi-agent systems
large language models
error handling
autonomous software engineering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-Evolving Multi-Agent Systems
Fine-Grained Post-Execution Analysis
Dynamic Error-Fixing Prioritization
Autonomous Environment Configuration
Expert Diagnosis Module
🔎 Similar Papers
No similar papers found.
X
Xinshuai Guo
Tsinghua University
J
Jiayi Kuang
Sun-Yat Sen University
L
Linyue Pan
Tsinghua University
Y
Yinghui Li
Tencent Youtu Lab
H
Hai-Tao Zheng
Tsinghua University
Y
Ying Shen
Sun-Yat Sen University
Di Yin
Di Yin
Tencent
LLMNLPMLLM
Xing Sun
Xing Sun
Tencent Youtu Lab
LLMMLLMAgent