Learning to Insert for Constructive Neural Vehicle Routing Solver

πŸ“… 2025-05-20
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing learning-based constructive approaches for the Vehicle Routing Problem (VRP) suffer from suboptimal solution quality due to their reliance on fixed-order node insertion. To address this, we propose the first neural constructive solver based on a dynamic insertion paradigm: instead of appending unvisited nodes in a predetermined sequence, our method flexibly inserts each node into any feasible position within the current partial solution. We design a dedicated insertion-position prediction network that integrates self-attention with sequential modeling, and train it via reinforcement learning. During inference, we combine greedy decoding with post-hoc local search to further improve solution quality. Evaluated on both synthetic and real-world instances of the Traveling Salesman Problem (TSP) and Capacitated VRP (CVRP), our approach consistently outperforms state-of-the-art constructive neural combinatorial optimization (NCO) methods, achieving superior solution accuracy and significantly enhanced generalization across problem scales.

Technology Category

Application Category

πŸ“ Abstract
Neural Combinatorial Optimisation (NCO) is a promising learning-based approach for solving Vehicle Routing Problems (VRPs) without extensive manual design. While existing constructive NCO methods typically follow an appending-based paradigm that sequentially adds unvisited nodes to partial solutions, this rigid approach often leads to suboptimal results. To overcome this limitation, we explore the idea of insertion-based paradigm and propose Learning to Construct with Insertion-based Paradigm (L2C-Insert), a novel learning-based method for constructive NCO. Unlike traditional approaches, L2C-Insert builds solutions by strategically inserting unvisited nodes at any valid position in the current partial solution, which can significantly enhance the flexibility and solution quality. The proposed framework introduces three key components: a novel model architecture for precise insertion position prediction, an efficient training scheme for model optimization, and an advanced inference technique that fully exploits the insertion paradigm's flexibility. Extensive experiments on both synthetic and real-world instances of the Travelling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) demonstrate that L2C-Insert consistently achieves superior performance across various problem sizes.
Problem

Research questions and friction points this paper is trying to address.

Improving flexibility in constructive NCO for VRPs
Overcoming suboptimal results from appending-based methods
Enhancing solution quality via strategic node insertion
Innovation

Methods, ideas, or system contributions that make the work stand out.

Insertion-based paradigm enhances solution flexibility
Novel model predicts precise insertion positions
Advanced inference exploits insertion flexibility fully
Fu Luo
Fu Luo
Southern University of Science and Technology
Neural combinatorial optimization
X
Xi Lin
Department of Computer Science, City University of Hong Kong, Hong Kong SAR, China
M
Mengyuan Zhong
School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an, China
F
Fei Liu
Department of Computer Science, City University of Hong Kong, Hong Kong SAR, China
Z
Zhenkun Wang
School of Automation and Intelligent Manufacturing & Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
Jianyong Sun
Jianyong Sun
School of Mathematics and Statistics, Xi'an Jiaotong University, China
evolutionary computationstatistical machine learning
Qingfu Zhang
Qingfu Zhang
Chair Professor, FIEEE, City University of Hong Kong
evolutionary computationmultiobjective optimizationcomputational intelligence