🤖 AI Summary
Current automated multi-agent design methods suffer from poor scenario generalization, limited tool integration, rigid communication architectures, and heavy reliance on external training data. To address these limitations, we propose a finite state machine (FSM)-based automated framework construction method that enables end-to-end system generation without training data, leveraging programmable state transitions and action control. The approach integrates large language models (LLMs) for task understanding and agent role orchestration, supporting dynamic tool invocation and adaptive communication topology reconfiguration. Experimental evaluation across multiple open-ended tasks demonstrates that systems generated by our method significantly outperform mainstream automated design baselines and achieve collaborative performance on par with manually fine-tuned systems. To the best of our knowledge, this is the first work to unify high flexibility, strong generalization, and zero-shot deployability in automated multi-agent system design.
📝 Abstract
Large Language Models (LLMs) have demonstrated the ability to solve a wide range of practical tasks within multi-agent systems. However, existing human-designed multi-agent frameworks are typically limited to a small set of pre-defined scenarios, while current automated design methods suffer from several limitations, such as the lack of tool integration, dependence on external training data, and rigid communication structures. In this paper, we propose MetaAgent, a finite state machine based framework that can automatically generate a multi-agent system. Given a task description, MetaAgent will design a multi-agent system and polish it through an optimization algorithm. When the multi-agent system is deployed, the finite state machine will control the agent's actions and the state transitions. To evaluate our framework, we conduct experiments on both text-based tasks and practical tasks. The results indicate that the generated multi-agent system surpasses other auto-designed methods and can achieve a comparable performance with the human-designed multi-agent system, which is optimized for those specific tasks.