2022
DOI: 10.1007/978-3-031-08011-1_14
|View full text |Cite
|
Sign up to set email alerts
|

Deep Policy Dynamic Programming for Vehicle Routing Problems

Abstract: Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms guarantee optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims to combine the strengths of learned neural heuristics with those of DP algorithms. DPDP prioritizes and restricts t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
257
2
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 112 publications
(262 citation statements)
references
References 54 publications
2
257
2
1
Order By: Relevance
“…Underpinned by enhancements in hardware and artificial intelligence research over the last years, the development of deep NNs made them relevant to a wide range of difficult combinatorial optimization problems, such as SAT, Minimum Vertex Cover, and Maximum Cut [142,143]. When applied to solve CVRPs, these networks are usually combined with reinforcement learning (RL [144, 145]) or typically used for node classification or edge prediction [146,147]. Despite extensive research, GNNs for directly solving CVRPs remain limited to small problem instances with up to 100 customers and generally do not compare favorably with classic optimization methods (exact or heuristic) in terms of solution quality.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Underpinned by enhancements in hardware and artificial intelligence research over the last years, the development of deep NNs made them relevant to a wide range of difficult combinatorial optimization problems, such as SAT, Minimum Vertex Cover, and Maximum Cut [142,143]. When applied to solve CVRPs, these networks are usually combined with reinforcement learning (RL [144, 145]) or typically used for node classification or edge prediction [146,147]. Despite extensive research, GNNs for directly solving CVRPs remain limited to small problem instances with up to 100 customers and generally do not compare favorably with classic optimization methods (exact or heuristic) in terms of solution quality.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, in particular, we aim to learn and use relatedness information in the LS and crossover operators of HGS to improve this stateof-the-art method substantially. We capitalize upon the work of [146], which trained a GNN to predict occurrence probabilities of edges in high-quality solutions (i.e., heatmap), and used this information to sparsify the underlying graph and accelerate related solution procedures. Instead, we leverage the heatmaps as a source of relatedness information to define neighborhood restrictions in the LS and possible re-connection points in the crossover.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations