Proceedings of the 48th International Conference on Parallel Processing 2019
DOI: 10.1145/3337821.3337913
|View full text |Cite
|
Sign up to set email alerts
|

Optimized Execution of Parallel Loops via User-Defined Scheduling Policies

Abstract: On-node parallelism continues to increase in importance for high-performance computing and most newly deployed supercomputers have tens of processor cores per node. These higher levels of on-node parallelism exacerbate the impact of load imbalance and locality in parallel computations, and current programming systems notably lack features to enable efficient use of these large numbers of cores or require users to modify codes significantly. Our work is motivated by the need to address application-specific load… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(14 citation statements)
references
References 38 publications
0
14
0
Order By: Relevance
“…Another direction of related work includes efforts that propose generic interfaces to allow users to implement their own loop scheduling techniques in different runtime libraries [28] [29] [30]. These efforts reduce the development challenges associated with the direct modification to the RTL source codes, i.e., developers can implement their scheduling technique via simplified, and ideally, well-documented interfaces.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Another direction of related work includes efforts that propose generic interfaces to allow users to implement their own loop scheduling techniques in different runtime libraries [28] [29] [30]. These efforts reduce the development challenges associated with the direct modification to the RTL source codes, i.e., developers can implement their scheduling technique via simplified, and ideally, well-documented interfaces.…”
Section: Related Workmentioning
confidence: 99%
“…However, these efforts do not exclude the need for extensive scheduling libraries to validate novel scheduling techniques and exploit the increased hardware parallelism of modern HPC systems. Therefore, such efforts [28], [29], [30] can be seen as potential methods that facilitate the development of another version of the LB4OMP scheduling library in the future.…”
Section: Related Workmentioning
confidence: 99%
“…Another direction of related work includes efforts that propose generic interfaces to allow users to implement their own loop scheduling techniques in different runtime libraries [25] [26] [27]. These efforts reduce the development challenges associated with the direct modification to the RTL source codes, i.e., developers can implement their scheduling technique via simplified, and ideally, well-documented interfaces.…”
Section: Related Workmentioning
confidence: 99%
“…The training and outreach activity is a cross-cutting effort which is supported by resources from SOLLVE and ECP Broader Engagement, with contributions by external collaborators, notably Lawrence Berkeley National Laboratory. A number of articles have also been published as part of the SOLLVE effort [76,77,78,79,80,81,82,83,84,85,86,87,88,89].…”
Section: Validation and Verification (Vandv)mentioning
confidence: 99%