Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms 2015
DOI: 10.1145/2833179.2833189
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic parallelism for simple and efficient GPU graph algorithms

Abstract: Dynamic parallelism allows GPU kernels to launch additional kernels at runtime directly from the GPU. In this paper we show that dynamic parallelism enables relatively simple high-performance graph algorithms for GPUs. We present breadth-first search (BFS) and single-source shortest paths (SSSP) algorithms that use dynamic parallelism to adapt to the irregular and data-driven nature of these problems. Our approach results in simple code that closely follows the highlevel description of the algorithms but yield… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 15 publications
(15 citation statements)
references
References 10 publications
0
15
0
Order By: Relevance
“…Concerning the programmability, our results contrast with the results of Zhang et al, where the use of CDP simplified the development of GPU‐based graph algorithms. According to our experience, using CDP is challenging and brings complexity to the code.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Concerning the programmability, our results contrast with the results of Zhang et al, where the use of CDP simplified the development of GPU‐based graph algorithms. According to our experience, using CDP is challenging and brings complexity to the code.…”
Section: Discussionmentioning
confidence: 99%
“…Although results show speedups up to 2.73×, the use of CDP causes a slowdown on the overall performance of the benchmark algorithms. CDP‐based algorithms for breadth‐first search (BFS) and single‐source shortest path (SSSP) are presented in . According to the authors, CDP can simplify the development of GPU‐based graph algorithms, because the use of CDP leads to a simpler code closer to its high‐level description.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The parallelization of irregular applications using CDP has received little attention in the literature. Particularly, CDP has been used for processing graphs, clustering, simulations, and backtracking algorithms [1,7,11,13,14]. According to related works, CDP is beneficial for processing applications whose data are hierarchically arranged.…”
Section: Related Work On Cuda Dynamic Parallelismmentioning
confidence: 99%
“…Tests were performed on an NVIDIA K20c GPU. In paper [30] authors analyzed performance of a DP-enabled algorithms for breadth-first search (BFS) and single-source shortest paths (SSSP) algorithms compared to other existing implementations showing performance better than some but not the best (compared to algorithms with advanced queueing for SSSP) results. Authors of paper [15] state that they obtained over 2.6 speedups for SSSP and over 1.4 for sparse matrix-vector multiplication (SpMV) codes compared to basic implementations without DP.…”
Section: Dynamic Parallelismmentioning
confidence: 99%