Proceedings of the Platform for Advanced Scientific Computing Conference 2021
DOI: 10.1145/3468267.3470574
|View full text |Cite
|
Sign up to set email alerts
|

Predictive, reactive and replication-based load balancing of tasks in Chameleon and sam(oa) 2

Abstract: Increasingly complex hardware architectures as well as numerical algorithms make balancing load in parallel numerical software for adaptive mesh refinement an inherently difficult task, especially if variability of system components and unpredictability of execution time comes into play. Yet, traditional predictive load balancing strategies are largely based on cost models that aim to predict the execution time of computational tasks. To address this fundamental weakness, we present a novel reactive load balan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…Because of reactive task migration, we do call "offloading" tasks instead of "stealing," and reactive action is taken from the overloaded/slow processes. 16 As shown in Figure 4B, the decision time of offloading tasks (t k ) is earlier than work stealing in Figure 4A. A detailed example of how tasks are offloaded reactively can be found in Appendix A.…”
Section: Reactive Load Balancingmentioning
confidence: 98%
See 4 more Smart Citations
“…Because of reactive task migration, we do call "offloading" tasks instead of "stealing," and reactive action is taken from the overloaded/slow processes. 16 As shown in Figure 4B, the decision time of offloading tasks (t k ) is earlier than work stealing in Figure 4A. A detailed example of how tasks are offloaded reactively can be found in Appendix A.…”
Section: Reactive Load Balancingmentioning
confidence: 98%
“…11,15 The following idea is task replication that aims at tackling unexpected performance variability. 16 However, this is difficult to know how many tasks should be offloaded at once and which processes are truly underloaded/fast in a short period. Without prior load knowledge, replication strategies need to fix the target process for replicas, such as left/right neighbor ranks.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations