2018
DOI: 10.1007/978-3-319-92040-5_20
|View full text |Cite
|
Sign up to set email alerts
|

TaskGenX: A Hardware-Software Proposal for Accelerating Task Parallelism

Abstract: As chip multi-processors (CMPs) are becoming more and more complex, software solutions such as parallel programming models are attracting a lot of attention. Task-based parallel programming models offer an appealing approach to utilize complex CMPs. However, the increasing number of cores on modern CMPs is pushing research towards the use of fine grained parallelism. Task-based programming models need to be able to handle such workloads and offer performance and scalability. Using specialized hardware for boos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 26 publications
0
5
0
Order By: Relevance
“…The granularity is then a balance between the degree of parallelism and the RS overhead. For that reason, several researches are conducted to delegate partially or totally the RS system to the hardware with the objective of relieving the worker threads, as in Chronaki et al (2018).…”
Section: Task-based Parallelizationmentioning
confidence: 99%
“…The granularity is then a balance between the degree of parallelism and the RS overhead. For that reason, several researches are conducted to delegate partially or totally the RS system to the hardware with the objective of relieving the worker threads, as in Chronaki et al (2018).…”
Section: Task-based Parallelizationmentioning
confidence: 99%
“…Chronaki et al [31] study how to efficiently assign task to cores in asymmetric multiprocessor systems by exploiting the critical path of the TDG. Moreover, the same authors target the bottleneck of task creation and dependence analysis by offloading the TDG construction to an already existing specialized hardware and studying how the runtime controls this hardware [30]. LibPRISM [93] is used to autotune the prefetcher policy and SMT levels in OpenMP parallel applications by profiling online the parallel regions or task characteristics..…”
Section: Exploiting Runtime System Information In the Architecturementioning
confidence: 99%
“…ISC 2018. [15] The publication presents the design and requirements of a hardware manager to accelerate the task management in the software runtimes.…”
Section: Thesis Publications and Contributionsmentioning
confidence: 99%
“…ISC 2018. [15] In this work, the DDAST implementation has been used to characterize the software runtime overheads and define the hardware manager requirements.…”
Section: Publicationsmentioning
confidence: 99%