2010
DOI: 10.1142/s0129626410000272
|View full text |Cite
|
Sign up to set email alerts
|

Some Gpu Algorithms for Graph Connected Components and Spanning Tree

Abstract: Graphics Processing Units (GPU) are application specific accelerators which provide high performance to cost ratio and are widely available and used, hence places them as a ubiquitous accelerator. A computing paradigm based on the same is the general purpose computing on the GPU (GPGPU) model. The GPU due to its graphics lineage is better suited for the data-parallel, data-regular algorithms. The hardware architecture of the GPU is not suitable for the data parallel but data irregular algorithms such as graph … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 21 publications
(20 citation statements)
references
References 17 publications
0
20
0
Order By: Relevance
“…Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].…”
mentioning
confidence: 88%
See 1 more Smart Citation
“…Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].…”
mentioning
confidence: 88%
“…On such regular applications, GPUs can outperform a single-core CPU performance by a large factor on average. In recent times, researchers have studied how GPUs perform on irregular computations such as list ranking [33], [21], connected components [26], among others. It is to be noted that in these cases, the speed-up compared to a single core CPU performance is only of the order of 10 or less.…”
Section: Introductionmentioning
confidence: 99%
“…Popular parallel algorithms in the PRAM model include the algorithm of Shiloach and Vishkin [41] and its variants by Greiner [19]. On GPUs, a variant of Shiloach and Vishkin [41] is used by Soman et al [42]. A heterogeneous execution of this algorithm on a CPU+GPU platform with an improvement of 35% on average is shown in [6].…”
Section: Related Workmentioning
confidence: 99%
“…However, because of the irregular nature of operations involved, this workload is often difficult to implement on most modern parallel architectures. Efficient implementations of the Shiloach and Vishkin algorithm are known to exist for a variety of parallel architectures including symmetric multiprocessors [5], Cray and CM2 [19], GPUs [42], and also on CPU+GPU systems [6].…”
Section: Connected Componentsmentioning
confidence: 99%
See 1 more Smart Citation