The GPU-based parallel Ant Colony System

Skinderowicz, Rafał

doi:10.1016/j.jpdc.2016.04.014

Cited by 48 publications

(26 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In an ACO, a population of agents (ants) construct, in parallel, a set of solutions to the optimization problem begin tackled. Unfortunately, the inherent parallel nature of the ACO does not translate easily into an efficient GPU-based parallel implementation [7,12,39]. The difficulties arise partly from the fact that not all of the ACO computations are independent, e.g., pheromone trail updates; as well as from the computing restrictions inflicted by GPU architectures.…”

Section: Introductionmentioning

confidence: 99%

“…It is possible to use the current global best value instead of the iteration best [43]. It is worth noting, that in contrast to the Ant Colony System (ACS), parallelization is made simpler because the MMAS lacks a local pheromone update [39]. In fact, the pheromone trail values remain constant during the solution construction phase, allowing a beforehand computation of the product of the pheromone trails and the heuristic values required by Eq.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Implementing a GPU-based parallel MAX–MIN Ant System

Skinderowicz

2020

Future Generation Computer Systems

Self Cite

View full text Add to dashboard Cite

The MAX-MIN Ant System (MMAS) is one of the best-known Ant Colony Optimization (ACO) algorithms proven to be efficient at finding satisfactory solutions to many difficult combinatorial optimization problems. The slow-down in Moore's law, and the availability of graphics processing units (GPUs) capable of conducting general-purpose computations at high speed, has sparked considerable research efforts into the development of GPU-based ACO implementations. In this paper, we discuss a range of novel ideas for improving the GPU-based parallel MMAS implementation, allowing it to better utilize the computing power offered by two subsequent Nvidia GPU architectures. Specifically, based on the weighted reservoir sampling algorithm we propose a novel parallel implementation of the node selection procedure, which is at the heart of the MMAS and other ACO algorithms. We also present a memory-efficient implementation of another keycomponent -the tabu list structure -which is used in the ACO's solution construction stage. The proposed implementations, combined with the existing approaches, lead to a total of six MMAS variants, which are evaluated on a set of Traveling Salesman Problem (TSP) instances ranging from 198 to 3,795 cities. The results show that our MMAS implementation is competitive with state-of-the-art GPU-based and multi-core CPU-based parallel ACO implementations: in fact, the times obtained for the Nvidia V100 Volta GPU were up to 7.18x and 21.79x smaller, respectively. The fastest of the proposed MMAS variants is able to generate over 1 million candidate solutions per second when solving a 1,002-city instance. Moreover, we show that, combined with the 2-opt local search heuristic, the proposed parallel MMAS finds high-quality solutions for the TSP instances with up to 18,512 nodes.

show abstract

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

Implementing a GPU-based parallel MAX–MIN Ant System

Skinderowicz

2020

Future Generation Computer Systems

Self Cite

View full text Add to dashboard Cite

show abstract

“…Because the proposed fusion between the ACS and SA is problem-agnostic one could try to apply it to solve other difficult combinatorial optimization problems. The performance of the proposed algorithms in terms of computation time could also be improved with the help of parallel computations, as the ACS is susceptible to parallelization even with modern GPUs [41]. Acknowledgments: This research was supported in part by PL-Grid Infrastructure.…”

Section: Instancementioning

confidence: 99%

An improved Ant Colony System for the Sequential Ordering Problem

Skinderowicz

2017

Computers & Operations Research

Self Cite

View full text Add to dashboard Cite

It is not rare that the performance of one metaheuristic algorithm can be improved by incorporating ideas taken from another. In this article we present how Simulated Annealing (SA) can be used to improve the efficiency of the Ant Colony System (ACS) and Enhanced ACS when solving the Sequential Ordering Problem (SOP). Moreover, we show how the very same ideas can be applied to improve the convergence of a dedicated local search, i.e. the SOP-3-exchange algorithm. A statistical analysis of the proposed algorithms both in terms of finding suitable parameter values and the quality of the generated solutions is presented based on a series of computational experiments conducted on SOP instances from the well-known TSPLIB and SOPLIB2006 repositories. The proposed ACS-SA and EACS-SA algorithms often generate solutions of better quality than the ACS and EACS, respectively. Moreover, the EACS-SA algorithm combined with the proposed SOP-3-exchange-SA local search was able to find 10 new best solutions for the SOP instances from the SOPLIB2006 repository, thus improving the state-of-the-art results as known from the literature. Overall, the best known or improved solutions were found in 41 out of 48 cases.Comment: 30 pages, 8 tables, 11 figure

show abstract

“…If > 0 , we use the formula (2) to select the next node, this method is also called roulette select method. Skinderowicz R [9] based on the characteristics of the thread bundle proposed a reduction algorithm, the maximum size of the calculation is limited to 32. This is because, in the CUDA architecture, the number of threads in a thread bundle is 32.…”

Section: Selection Of Next Nodementioning

confidence: 99%

An Approximate Algorithm for the Steiner Tree Problem based on Ant Colony Algorithm

Jia¹,

Huang²

2017

dtcse

View full text Add to dashboard Cite

Ant colony optimization (ACO) is a well-known heuristic intelligent method which is used to solve combinatorial optimization problem and can obtain high-quality results in a reasonable execution time. Steiner Tree Problem (STP) is a classic NPcomplete problem. The computational complexity of the exact algorithm increases exponentially with the increment of the size of the problem. In this paper, we propose an ant colony optimization algorithm with data parallelism to solve the STP. The simulation results show that the algorithm has good performance in result quality.

show abstract

The GPU-based parallel Ant Colony System

Cited by 48 publications

References 28 publications

Implementing a GPU-based parallel MAX–MIN Ant System

Implementing a GPU-based parallel MAX–MIN Ant System

An improved Ant Colony System for the Sequential Ordering Problem

An Approximate Algorithm for the Steiner Tree Problem based on Ant Colony Algorithm

Contact Info

Product

Resources

About