An End-to-End Learning Approach for Solving Capacitated Location-Routing Problems

๐Ÿ“… 2025-11-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This paper addresses the Capacitated Location-Routing Problem (CLRP) and its open variant (OCLRP), which jointly optimize facility location and vehicle routing under stringent capacity constraints, inducing strong decision interdependence. To tackle this challenge, we formulate CLRP as a multi-stage Markov Decision Process (MDP) for the first time and propose an end-to-end deep reinforcement learning (DRL) framework. Built upon an encoder-decoder architecture, our method introduces a heterogeneous query attention mechanism that dynamically adapts to the semantic requirements of distinct decision stagesโ€”location selection and route construction. This design unifies the modeling of location-routing coordination and enables scalable DRL-based optimization. Extensive experiments on synthetic and benchmark datasets demonstrate that our approach significantly outperforms classical heuristics and state-of-the-art DRL baselines in both solution quality and generalization capability.

Technology Category

Application Category

๐Ÿ“ Abstract
The capacitated location-routing problems (CLRPs) are classical problems in combinatorial optimization, which require simultaneously making location and routing decisions. In CLRPs, the complex constraints and the intricate relationships between various decisions make the problem challenging to solve. With the emergence of deep reinforcement learning (DRL), it has been extensively applied to address the vehicle routing problem and its variants, while the research related to CLRPs still needs to be explored. In this paper, we propose the DRL with heterogeneous query (DRLHQ) to solve CLRP and open CLRP (OCLRP), respectively. We are the first to propose an end-to-end learning approach for CLRPs, following the encoder-decoder structure. In particular, we reformulate the CLRPs as a markov decision process tailored to various decisions, a general modeling framework that can be adapted to other DRL-based methods. To better handle the interdependency across location and routing decisions, we also introduce a novel heterogeneous querying attention mechanism designed to adapt dynamically to various decision-making stages. Experimental results on both synthetic and benchmark datasets demonstrate superior solution quality and better generalization performance of our proposed approach over representative traditional and DRL-based baselines in solving both CLRP and OCLRP.
Problem

Research questions and friction points this paper is trying to address.

Solving capacitated location-routing problems with complex constraints
Developing end-to-end learning approach for location and routing decisions
Handling interdependency between facility location and vehicle routing
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end deep reinforcement learning approach
Markov decision process for decision modeling
Heterogeneous querying attention mechanism
๐Ÿ”Ž Similar Papers
2024-08-30Transportation Research Part E: Logistics and Transportation ReviewCitations: 0
Changhao Miao
Changhao Miao
Beijing Institute of Technology
Machine LearningOptimization
Y
Yuntian Zhang
National Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
T
Tongyu Wu
National Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
Fang Deng
Fang Deng
Beijing Institute of Technology
New EnergyIntelligent Information ProcessingIntelligent Wearable System
C
Chen Chen
National Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China