2011 18th International Conference on High Performance Computing 2011
DOI: 10.1109/hipc.2011.6152655
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid algorithms for list ranking and graph connected components

Abstract: The advent of multicore and many-core architectures saw them being deployed to speed-up computations across several disciplines and application areas. Prominent examples include semi-numerical algorithms such as sorting, graph algorithms, image processing, scientific computations, and the like. In particular, using GPUs for general purpose computations has attracted a lot of attention given that GPUs can deliver more than one TFLOP of computing power at very low prices.In this work, we use a new model of multi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 32 publications
(15 citation statements)
references
References 24 publications
0
15
0
Order By: Relevance
“…• On the connected components problem, we get an average 20% improvement over the best known result on an identical platform [6]. A small change to the algorithm can also build a spanning tree of a graph with very little extra time.…”
Section: Our Resultsmentioning
confidence: 91%
See 1 more Smart Citation
“…• On the connected components problem, we get an average 20% improvement over the best known result on an identical platform [6]. A small change to the algorithm can also build a spanning tree of a graph with very little extra time.…”
Section: Our Resultsmentioning
confidence: 91%
“…On GPUs, a variant of Shiloach and Vishkin [41] is used by Soman et al [42]. A heterogeneous execution of this algorithm on a CPU+GPU platform with an improvement of 35% on average is shown in [6].…”
Section: Related Workmentioning
confidence: 98%
“…A single cluster node is defined by the number of CPU cores (line 6), the communication performance of the node-internal interconnection network (lines 7-8) and the compute performance of the processor cores (line 9). Additionally, each cluster node contains a list of zero or more submachines (lines [10][11][12][13][14][15][16][17] where each submachine has a unique identifier and a submachine type, which is used to determine whether a given parallel task is executable on this submachine. The compute performance of a submachine is specified by the average execution time of an arithmetic instruction (lines 12 and 16).…”
Section: ) Specification Of Hybrid Platformsmentioning
confidence: 99%
“…8 shows an example schedule for a hybrid platform. A schedule consists of communication operations (lines 2-12) and the execution of parallel tasks (lines [13][14][15][16][17][18][19] where each operation has an associated start time and an associated finish time. A communication operation additionally contains the identifiers of the source and target nodes in the underlying task graph and the definition of the source and target execution units.…”
Section: ) Schedule Specificationmentioning
confidence: 99%
See 1 more Smart Citation