2021
DOI: 10.1371/journal.pone.0260995
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid pointer networks for traveling salesman problems optimization

Abstract: In this work, we proposed a hybrid pointer network (HPN), an end-to-end deep reinforcement learning architecture is provided to tackle the travelling salesman problem (TSP). HPN builds upon graph pointer networks, an extension of pointer networks with an additional graph embedding layer. HPN combines the graph embedding layer with the transformer’s encoder to produce multiple embeddings for the feature context. We conducted extensive experimental work to compare HPN and Graph pointer network (GPN). For the sac… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…In this section, we reformulate the PDP as a reinforcement learning (RL) problem, which is followed by the development of a model based on the encoder and decoder structure to learn node selection process for solution construction empowered by Hybrid pointer networks (HPN) [ 10 ].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this section, we reformulate the PDP as a reinforcement learning (RL) problem, which is followed by the development of a model based on the encoder and decoder structure to learn node selection process for solution construction empowered by Hybrid pointer networks (HPN) [ 10 ].…”
Section: Methodsmentioning
confidence: 99%
“…In order to learn policy π , following several previous studies [ 10 ] where the HPN model was built upon the pointer networks, we built a policy network with an encoder-decoder structure. Given the features of PDP, the HPN was expected to learn the link between the nodes of various roles, allowing the precedence constraint to be captured intrinsically.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…A DQN could also be given a graph that describes the paths between cities in the U.S. and the cost of traveling between any two adjacent cities (i.e. the traveling salesman problem [7]). If a given graph has N cities, the computational complexity of this NP-complete problem is O(N 22N ).…”
Section: Introductionmentioning
confidence: 99%
“…Ma et al (2019) [15] used hierarchical RL for training in a graph pointer network for TSP. Stohy et al (2021) [16] used an actor-critic method for training in a hybrid pointer network. These algorithms use RL to train neural networks by testing a large number of small-scale TSP datasets, which consumes a great deal of time and resources, and the accuracy of the solution is not ideal.…”
Section: Introductionmentioning
confidence: 99%