2004
DOI: 10.1504/ijhpcn.2004.007564
|View full text |Cite
|
Sign up to set email alerts
|

Towards optimisation of openMP codes for synchronisation and data reuse

Abstract: In this paper, we present the compiler transformation of OpenMP code to an ordered collection of tasks, and the compile-time as well as runtime mapping of the resulting task graph to threads for data reuse. The ordering of tasks is relaxed where possible so that the code may be executed in a more loosely synchronous fashion. Our current implementation uses a runtime system that permits tasks to begin execution as soon as their predecessors have completed. A comparison of the performance of two example programs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2012
2012
2019
2019

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(1 citation statement)
references
References 35 publications
0
1
0
Order By: Relevance
“…We create private instances of sub-arrays to spread computation among threads to ensure the data locality, where this study has been discussed (Liu et al, 2003). Programs written in SPMD style has also been shown to provide scalable performance, which is superior to a straightforward parallelisation of loop (Weng and Chapman, 2004).…”
Section: Bit-reversal Computationmentioning
confidence: 99%
“…We create private instances of sub-arrays to spread computation among threads to ensure the data locality, where this study has been discussed (Liu et al, 2003). Programs written in SPMD style has also been shown to provide scalable performance, which is superior to a straightforward parallelisation of loop (Weng and Chapman, 2004).…”
Section: Bit-reversal Computationmentioning
confidence: 99%