Agent-GWO: Collaborative Agents for Dynamic Prompt Optimization in Large Language Models

📅 2026-04-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

201K/year
🤖 AI Summary
This work addresses the limitations of current large language models, whose reasoning performance is constrained by handcrafted static prompts and sensitivity to decoding configurations and task distributions, leading to insufficient generalization and stability. Existing automatic prompt optimization approaches predominantly rely on single-agent local search, making it difficult to jointly optimize prompts and hyperparameters. To overcome this, the paper proposes Agent-GWO, a novel framework that introduces swarm intelligence—specifically, the Grey Wolf Optimizer (GWO)—to this domain for the first time. By leveraging the collaborative guidance of α, β, and δ leader agents in GWO, the method unifies prompt templates and decoding hyperparameters into an evolvable agent configuration, enabling dynamic, global joint optimization within a single framework. Experiments demonstrate consistent and significant improvements in accuracy and robustness across multiple mathematical and mixed-reasoning benchmarks, outperforming existing prompt optimization techniques across diverse large language model backbones.

Technology Category

Application Category

📝 Abstract
Large Language Models (LLMs) have demonstrated strong capabilities in complex reasoning tasks, while recent prompting strategies such as Chain-of-Thought (CoT) have further elevated their performance in handling complex logical problems. Despite these advances, high-quality reasoning remains heavily reliant on manual static prompts and is sensitive to decoding configurations and task distributions, leading to performance fluctuations and limited transferability. Existing automatic prompt optimization methods typically adopt single-agent local search, failing to simultaneously optimize prompts and decoding hyperparameters within a unified framework to achieve stable global improvements. To address this limitation, we propose Agent-GWO, a dynamic prompt optimization framework for complex reasoning. Specifically, we unify prompt templates and decoding hyperparameters as inheritable agent configurations. By leveraging the leader-follower mechanism of the Grey Wolf Optimizer (GWO), we automatically select three leader agents ($α$, $β$, and $δ$) to guide the collaborative updates of the remaining agents, enabling iterative convergence toward robust optimal reasoning configurations that can be seamlessly integrated for inference. Extensive experiments on multiple mathematical and hybrid reasoning benchmarks across diverse LLM backbones show that Agent-GWO consistently improves accuracy and stability over existing prompt optimization methods. The code will be released publicly.
Problem

Research questions and friction points this paper is trying to address.

prompt optimization
large language models
decoding hyperparameters
reasoning stability
transferability
Innovation

Methods, ideas, or system contributions that make the work stand out.

dynamic prompt optimization
multi-agent collaboration
Grey Wolf Optimizer
decoding hyperparameter tuning
large language models
X
Xudong Wang
Kyung Hee University
Chaoning Zhang
Chaoning Zhang
Professor at UESTC (电子科技大学, China)
Computer VisionLLM and VLMGenAI and AIGC Detection
Chenghao Li
Chenghao Li
PhD Candidate, Japan Advanced Institute of Science and Technology
RoboticsGraspingHuman-Robot InteractionAI SecurityComputer Vision
S
Shuxu Chen
Kyung Hee University
Q
Qigan Sun
Kyung Hee University
J
Jiaquan Zhang
University of Electronic Science and Technology of China
F
Fachrina Dewi Puspitasari
University of Electronic Science and Technology of China
Tae-Ho Kim
Tae-Ho Kim
Nota Inc.
Jiwei Wei
Jiwei Wei
Professor at University of Electronic Science and Technology of China (UESTC)
Cross-Modal RetrievalMetric LearningAdversarial Machine LearningAIGC
M
Malu Zhang
University of Electronic Science and Technology of China
Guoqing Wang
Guoqing Wang
University of Electronic Science and Technology of China
Computer VisionMachine LearningPattern RecognitionIntelligent System
Y
Yang Yang
University of Electronic Science and Technology of China
H
Heng Tao Shen
Tongji University