2021
DOI: 10.3390/su132212906
|View full text |Cite
|
Sign up to set email alerts
|

Solving Traveling Salesman Problem with Time Windows Using Hybrid Pointer Networks with Time Features

Abstract: This paper introduces a time efficient deep learning-based solution to the traveling salesman problem with time window (TSPTW). Our goal is to reduce the total tour length traveled by -*the agent without violating any time limitations. This will aid in decreasing the time required to supply any type of service, as well as lowering the emissions produced by automobiles, allowing our planet to recover from air pollution emissions. The proposed model is a variation of the pointer networks that has a better abilit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…However, in traditional heuristic techniques, the optimum performance, or hand-engineered rules, heavily relied on human talent and experience, which left much room for future development in solution quality. To date, deep reinforcement learning (DRL) is being increasingly used to solve several combinatorial optimization problems, such as TSP, TWTSP, and VRPs [ 33 ]. Most models based on deep reinforcement learning are defined by the policy network of the encoder and the decoder structures.…”
Section: Related Workmentioning
confidence: 99%
“…However, in traditional heuristic techniques, the optimum performance, or hand-engineered rules, heavily relied on human talent and experience, which left much room for future development in solution quality. To date, deep reinforcement learning (DRL) is being increasingly used to solve several combinatorial optimization problems, such as TSP, TWTSP, and VRPs [ 33 ]. Most models based on deep reinforcement learning are defined by the policy network of the encoder and the decoder structures.…”
Section: Related Workmentioning
confidence: 99%