2020
DOI: 10.48550/arxiv.2003.03600
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reinforcement Learning for Combinatorial Optimization: A Survey

Abstract: Combinatorial optimization (CO) is the workhorse of numerous important applications in operations research, engineering and other fields and, thus, has been attracting enormous attention from the research community for over a century. Many efficient solutions to common problems involve using hand-crafted heuristics to sequentially construct a solution. Therefore, it is intriguing to see how a combinatorial optimization problem can be formulated as a sequential decision making process and whether efficient heur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 4 publications
0
18
0
Order By: Relevance
“…Related to our approach, Cappart et al (2020) propose to combine reinforcement learning, constraint programming and dynamic programming and experiment with the TSP with time windows. For surveys of machine learning for routing problems and combinatorial optimization in general, we refer to Mazyavkina et al (2020); Vesselinova et al (2020).…”
Section: Machine Learning For Vehicle Routing Problemsmentioning
confidence: 99%
“…Related to our approach, Cappart et al (2020) propose to combine reinforcement learning, constraint programming and dynamic programming and experiment with the TSP with time windows. For surveys of machine learning for routing problems and combinatorial optimization in general, we refer to Mazyavkina et al (2020); Vesselinova et al (2020).…”
Section: Machine Learning For Vehicle Routing Problemsmentioning
confidence: 99%
“…Solving single-agent routing (scheduling) problems with RL. According to [26], the RL approaches to solving agent routing problems can be categorized into: (1) improvement heuristics learns to rewrite the complete solution iteratively to obtain a better solution [43,5,4,24]; (2) construction approach learns to construct a solution by sequentially assigning idle agents to unvisited cities until the full routing schedule (sequence) is constructed [3,28,20,19], and (3) hybrid approaches blending both approaches [17,7,21,1]. Typically, learning-based improvement or hybrid approaches have shown good performance since these can iteratively update the best solution until reaching the best one.…”
Section: Related Workmentioning
confidence: 99%
“…For example, Kong et al (2019) applies RL to knapsack and secretary problems, and Khalil et al (2017) uses RL to solve graph problems. Mazyavkina et al (2020) and Bengio et al (2020) provide an extensive survey on applications of ML and RL in combinatorial optimization. For the specific problem of shipping optimization, van Andel (2018) uses ML to consolidate shipments from nearby suppliers.…”
Section: Related Workmentioning
confidence: 99%