Some Gpu Algorithms for Graph Connected Components and Spanning Tree

Soman, Jyothish; Kothapalli, Kishore; Narayanan, P. J.

doi:10.1142/s0129626410000272

Cited by 21 publications

(20 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].…”

mentioning

confidence: 88%

“…On such regular applications, GPUs can outperform a single-core CPU performance by a large factor on average. In recent times, researchers have studied how GPUs perform on irregular computations such as list ranking [33], [21], connected components [26], among others. It is to be noted that in these cases, the speed-up compared to a single core CPU performance is only of the order of 10 or less.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Hybrid algorithms for list ranking and graph connected components

Banerjee

Kothapalli

2011

2011 18th International Conference on High Performance Computing

Self Cite

View full text Add to dashboard Cite

The advent of multicore and many-core architectures saw them being deployed to speed-up computations across several disciplines and application areas. Prominent examples include semi-numerical algorithms such as sorting, graph algorithms, image processing, scientific computations, and the like. In particular, using GPUs for general purpose computations has attracted a lot of attention given that GPUs can deliver more than one TFLOP of computing power at very low prices.In this work, we use a new model of multicore computing called hybrid multicore computing where the computation is performed simultaneously a control device, such as a CPU, and an accelerator such as a GPU. To this end, we use two case studies to explore the algorithmic and analytical issues in hybrid multicore computing. Our case studies involve two different ways of designing hybrid multicore algorithms. The main contribution of this paper is to address the issues related to the design of hybrid solutions.We show our hybrid algorithm for list ranking is faster by 50% compared to the best known implementation [Z. Wei, J. JaJa; IPDPS 2010]. Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].

show abstract

“…Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].…”

mentioning

confidence: 88%

Section: Introductionmentioning

confidence: 99%

Hybrid algorithms for list ranking and graph connected components

Banerjee

Kothapalli

2011

2011 18th International Conference on High Performance Computing

Self Cite

View full text Add to dashboard Cite

show abstract

“…Popular parallel algorithms in the PRAM model include the algorithm of Shiloach and Vishkin [41] and its variants by Greiner [19]. On GPUs, a variant of Shiloach and Vishkin [41] is used by Soman et al [42]. A heterogeneous execution of this algorithm on a CPU+GPU platform with an improvement of 35% on average is shown in [6].…”

Section: Related Workmentioning

confidence: 99%

“…However, because of the irregular nature of operations involved, this workload is often difficult to implement on most modern parallel architectures. Efficient implementations of the Shiloach and Vishkin algorithm are known to exist for a variety of parallel architectures including symmetric multiprocessors [5], Cray and CM2 [19], GPUs [42], and also on CPU+GPU systems [6].…”

Section: Connected Componentsmentioning

confidence: 99%