COAgents: Multi-Agent Framework to Learn and Navigate Routing Problems Search Space

📅 2026-05-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

224K/year
🤖 AI Summary
This work addresses the challenge of solving large-scale Vehicle Routing Problems (VRP), which are notoriously difficult due to their combinatorial complexity and the limited generalization capability of traditional heuristics. The authors propose a multi-agent graph search framework that models the search process as a dynamically constructed partial search graph, guided collaboratively by three specialized agents responsible for node selection, move selection, and strategic jumping. By decoupling problem-agnostic search control from domain-specific encodings, the framework significantly enhances cross-instance adaptability and exploration efficiency. Evaluated on standard VRPTW benchmarks, the method achieves a new state-of-the-art among learning-based approaches, reducing the optimality gap by 14%–44% compared to POMO and by 21%–40% relative to ALNS.
📝 Abstract
Although Vehicle Routing Problems (VRP) are essential to many real-world systems, they remain computationally intractable at scale due to their combinatorial complexity. Traditional heuristics rely on handcrafted rules for local improvements and occasional \textit{jumps} to escape local minima, but often struggle to generalize across diverse instances. We introduce \textbf{COAgents}, a cooperative multi-agent framework that models the search process as a graph: nodes represent solutions, and edges correspond to either local refinements or large perturbations for diversification (i.e., jumps). A \textit{Partial Search Graph} (PSG) is dynamically constructed during search, enabling COAgents to train a Node Selection Agent and a Move Selection Agent to guide intensification, and a Jump Agent to trigger well-timed explorations of new regions. Unlike end-to-end learning approaches, COAgents cleanly separates problem-agnostic search control from compact domain-specific encoding, facilitating adaptability across tasks. Extensive experiments on the CVRP and VRPTW benchmarks show that COAgents remains competitive with several learn-to-search baselines on CVRP and sets a new state of the art among learning-based methods on the more challenging VRPTW instances, reducing the gap to the best-known solutions by 14\% at $N\!=\!100$ and 44\% at $N\!=\!50$ relative to the strongest neural solver (POMO), and by 21\% and 40\% respectively relative to ALNS. Code is available at https://github.com/mahdims/COAgents.
Problem

Research questions and friction points this paper is trying to address.

Vehicle Routing Problem
Combinatorial Optimization
Search Space Navigation
Generalization
Local Minima Escape
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-agent framework
search space navigation
vehicle routing problem
partial search graph
learn-to-search
🔎 Similar Papers
No similar papers found.
O
Oleksandr Yakovenko
Huawei Technologies Canada, 4321 Still Creek Dr, Burnaby, BC, Canada V5C 6S7
M
Mahdi Mostajabdaveh
Huawei Technologies Canada, 4321 Still Creek Dr, Burnaby, BC, Canada V5C 6S7
C
Cheikh Ahmed
Huawei Technologies Canada, 4321 Still Creek Dr, Burnaby, BC, Canada V5C 6S7
A
Abdullah Ali Sivas
Huawei Technologies Canada, 4321 Still Creek Dr, Burnaby, BC, Canada V5C 6S7
X
Xiaorui Li
Huawei Technologies Canada, 4321 Still Creek Dr, Burnaby, BC, Canada V5C 6S7
Zirui Zhou
Zirui Zhou
Huawei Technologies Canada
Mathematical OptimizationDesign and Analysis of AlgorithmsMachine Learning
M
Mao Kun
Huawei Technologies, China