2011
DOI: 10.1007/s10586-010-0151-6
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing dataflow applications on heterogeneous environments

Abstract: The increases in multi-core processor parallelism and in the flexibility of many-core accelerator processors, such as GPUs, have turned traditional SMP systems into hierarchical, heterogeneous computing environments. Fully exploiting these improvements in parallel system design remains an open problem. Moreover, most of the current tools for the development of parallel applications for hierarchical systems concentrate on the use of only a single processor type (e.g., accelerators) and do not coordinate several… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0
4

Year Published

2013
2013
2018
2018

Publication Types

Select...
6
3

Relationship

5
4

Authors

Journals

citations
Cited by 22 publications
(24 citation statements)
references
References 36 publications
0
20
0
4
Order By: Relevance
“…Execution on distributed CPU-GPU platforms has been the target of a number of projects [5], [6], [15], [16], [8], [17], [18], [20]. Ravi et al [17] developed compiler based translation of generalized reductions to CPU-GPU systems.…”
Section: Related Workmentioning
confidence: 99%
“…Execution on distributed CPU-GPU platforms has been the target of a number of projects [5], [6], [15], [16], [8], [17], [18], [20]. Ravi et al [17] developed compiler based translation of generalized reductions to CPU-GPU systems.…”
Section: Related Workmentioning
confidence: 99%
“…There are several recent efforts on task scheduling for hybrid machines [17,19,24,26,9,27,15,28]. Most of the previous works deal with task mapping for applications in which operations attain similar speedups when executed on a GPU vs a CPU.…”
Section: Related Workmentioning
confidence: 99%
“…Efficient execution of applications on distributed CPUGPU equipped platforms has been an objective of several projects [23], [24], [25], [26], [22], [29], [30]. Ravi et al [24], [26] proposes techniques for automatic translation of generalized reductions to CPU-GPU environments via compile-time techniques, which are coupled with runtime support to coordinate execution.…”
Section: Related Workmentioning
confidence: 99%