Deep Policy Dynamic Programming for Vehicle Routing Problems

Kool, Wouter; Hoof, Herke van; Gromicho, Joaquim; Welling, Max

doi:10.1007/978-3-031-08011-1_14

Cited by 112 publications

(262 citation statements)

References 54 publications

Supporting

Mentioning

257

Contrasting

Unclassified

Order By: Relevance

“…Underpinned by enhancements in hardware and artificial intelligence research over the last years, the development of deep NNs made them relevant to a wide range of difficult combinatorial optimization problems, such as SAT, Minimum Vertex Cover, and Maximum Cut [142,143]. When applied to solve CVRPs, these networks are usually combined with reinforcement learning (RL [144, 145]) or typically used for node classification or edge prediction [146,147]. Despite extensive research, GNNs for directly solving CVRPs remain limited to small problem instances with up to 100 customers and generally do not compare favorably with classic optimization methods (exact or heuristic) in terms of solution quality.…”

Section: Discussionmentioning

confidence: 99%

“…In this study, in particular, we aim to learn and use relatedness information in the LS and crossover operators of HGS to improve this stateof-the-art method substantially. We capitalize upon the work of [146], which trained a GNN to predict occurrence probabilities of edges in high-quality solutions (i.e., heatmap), and used this information to sparsify the underlying graph and accelerate related solution procedures. Instead, we leverage the heatmaps as a source of relatedness information to define neighborhood restrictions in the LS and possible re-connection points in the crossover.…”

Section: Discussionmentioning

confidence: 99%

“…This definition is general: in the simplest setting, relatedness could be the inverse of distance, i.e., ϕ(i, j) = 1/d ij . In a more informed setting, we can instead consider defining ϕ(i, j) as the output of a graph neural network (GNN) as seen in [146], predicting the probability of occurrence of an edge in a high-quality solution. Probabilities of this kind are typically called heatmaps.…”

Section: Methodsmentioning

confidence: 99%

“…In light of this, an emerging line of research embraces neural networks (NN) as (sub)routines to solve CO problems. For example, [146] presented a graph NN that predicts the probability of each edge participating in a highquality solution. All edges whose probabilities are below a threshold are then used to prune the search space of solutions in applications for the capacitated vehicle routing problem (CVRP) and the traveling salesperson problem (TSP).…”

Section: List Of Tablesmentioning

confidence: 99%

“…All experiments are run on a single thread of an Intel Gold 6148 Skylake 2.4 GHz processor with 40 GB of RAM and NVIDIA Tesla P100 Pascal (12 G memory), running CentOS 7.8.2003. Unless otherwise stated, we use the original parameters defined for HGS in [136] and the GNN in [146]. To achieve fast convergence, we set smaller values for the population-size parameters in HGS: µ = 12 and λ = 20.…”

Section: Computational Environmentmentioning

confidence: 99%

See 4 more Smart Citations

Exploring the Frontier of Combinatorial Optimization and Machine Learning: Applications to Vehicle Routing and Support Vector Machines

SANTANA¹

View full text Add to dashboard Cite

I would like to thank my family: my parents, Mariza, Jaci, and Emilio, and to my girlfriend, Daniella, and her family. Thanks for all the fantastic support, comprehension, and inspiration that helped to keep me focused on my research.I also would like to express my deep gratitude to my advisor Thibaut Vidal for all the outstanding support, lessons, guidance, car rides to university, funny moments, and friendship. I learned so much from his lectures, conversations, attitude, and example.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: List Of Tablesmentioning

confidence: 99%

Section: Computational Environmentmentioning

confidence: 99%

See 3 more Smart Citations

Exploring the Frontier of Combinatorial Optimization and Machine Learning: Applications to Vehicle Routing and Support Vector Machines

SANTANA¹

View full text Add to dashboard Cite

show abstract

Iterative sampling of expensive simulations for faster deep surrogate training*

Gaffney

Humbird

Kruse

et al. 2023

Contributions to Plasma Physics

View full text Add to dashboard Cite

Deep neural network (DNN) surrogates of expensive physics simulations are enabling a rapid change in the way that common experimental design and analysis tasks are approached. Surrogate models allow simulations to be performed in parallel and separately from downstream tasks, thereby enabling analyses that would be impossible with the simulation in‐the‐loop; surrogates based on DNNs can effectively emulate diverse non‐scalar data of the types collected in fusion and laboratory‐astrophysics experiments. The challenge is in training the surrogate model, for which large ensembles of physics simulations must be run, preferably without wasting computational effort on uninteresting simulations. In this paper, we present an iterative sampling scheme that can preferentially propose simulations in interesting regions of parameter space without neglecting unexplored regions, allowing high‐quality and wide‐ranging surrogate models to be trained using 2–3 times fewer simulations compare to space‐filling designs. Our approach uses an explicit importance function defined on the simulation output space, balanced against a measure of simulation density which serves as a proxy for surrogate accuracy. It is easy to implement and can be tuned to find interesting simulations early in the study, allowing surrogates to be trained quickly and refined as new simulations become available; this represents an important step towards the routine generation of deep surrogate models quickly enough to be truly relevant to experimental work.

show abstract

Learning to repeatedly solve routing problems

Morabit,

Desaulniers,

Lodi

2023

Networks

View full text Add to dashboard Cite

In the last years, there has been a great interest in machine‐learning‐based heuristics for solving NP‐hard combinatorial optimization problems. The developed methods have shown potential on many optimization problems. In this paper, we present a learned heuristic for the reoptimization of a problem after a minor change in its data. We focus on the case of the capacited vehicle routing problem with static clients (i.e., same client locations) and changed demands. Given the edges of an original solution, the goal is to predict and fix the ones that have a high chance of remaining in an optimal solution after a change of client demands. This partial prediction of the solution reduces the complexity of the problem and speeds up its resolution, while yielding a good quality solution. The proposed approach resulted in solutions with an optimality gap ranging from 0% to 1.7% on different benchmark instances within a reasonable computing time.

show abstract

Deep Policy Dynamic Programming for Vehicle Routing Problems

Cited by 112 publications

References 54 publications

Exploring the Frontier of Combinatorial Optimization and Machine Learning: Applications to Vehicle Routing and Support Vector Machines

Exploring the Frontier of Combinatorial Optimization and Machine Learning: Applications to Vehicle Routing and Support Vector Machines

Iterative sampling of expensive simulations for faster deep surrogate training*

Learning to repeatedly solve routing problems

Contact Info

Product

Resources

About