2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2022
DOI: 10.1109/ipdps53621.2022.00042
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Phase Task-Based HPC Applications: Quickly Learning how to Run Fast

Abstract: Parallel applications performance strongly depends on the number of resources. Although adding new nodes usually reduces execution time, excessive amounts are often detrimental as they incur substantial communication overhead, which is difficult to anticipate. Characteristics like network contention, data distribution methods, synchronizations, and how communications and computations overlap generally impact the performance. Finding the correct number of resources can thus be particularly tricky for multi-phas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 24 publications
0
2
0
1
Order By: Relevance
“…Figure 11 -depicts three iterations of ExaGeoStat package where the x -axis is the time, and the y-axis has the aggregated resource type utilization per node. The different colors correspond to different phases: the yellow ones are the generation, while the green ones are the tasks the factorization, a small number of tasks in gray correspond to the other three phases [15].…”
Section: Related Workmentioning
confidence: 99%
“…Figure 11 -depicts three iterations of ExaGeoStat package where the x -axis is the time, and the y-axis has the aggregated resource type utilization per node. The different colors correspond to different phases: the yellow ones are the generation, while the green ones are the tasks the factorization, a small number of tasks in gray correspond to the other three phases [15].…”
Section: Related Workmentioning
confidence: 99%
“…Embora o paradigma baseado em tarefas atenue a sobrecarga de comunicação, efeitos imprevisíveis (contenção, sincronizações) continuam a ser possíveis e particularmente difíceis de modelar a priori, especialmente quando se explora conjuntos heterogêneos de nós [Nesi et al 2022]. Como a quantidade de nós é diretamente relacionada ao desempenho e a estes efeitos, métodos que aprendem e se adaptam são desejáveis.…”
Section: Introductionunclassified
“…The MLOE/MMOM tools are used to assess the loss of the prediction efficiency by using the approximated or misspecified covariance models (Hong et al, 2021). The Fisher information matrix provides information about the importance of a single observation in estimating unknown statistical parameters, i.e., uncertainty Intel Processor Intel Haswell (Abdulah et al, 2018a(Abdulah et al, ,b, 2019, Intel Broadwell (Abdulah et al, 2018a,b), Intel KNL (Abdulah et al, 2018a,b), Intel Skylake (Abdulah et al, 2018b(Abdulah et al, ,2019Salvaña et al, 2022a), and Intel Cascade Lake (Mondal et al, 2022;Salvaña et al, 2022a) AMD Processor AMD EPYC (Salvaña et al, 2022a), and AMD Milan (Mondal et al,2022) ARM Processor ThunderX2 ARM (Salvaña et al,2022a) NVIDIA GPUs NVIDIA K80s (Abdulah et al, 2018a(Abdulah et al, , 2019, NVIDIA P100 (Abdulah et al,2019), and NVIDIA V100 (Abdulah et al, 2019;Salvaña et al, 2022a) Supercomputers KAUST Shaheen-II CrayXC40 (Abdulah et al, 2018a(Abdulah et al, ,b, 2019(Abdulah et al, , 2021Mondal et al,2022;Salvaña et al,2022a,b), HLRS HPE APOLLO (HAWK) , ORNL Summit (Abdulah et al, 2021), INRIA Grid5000 (Nesi et al,2021(Nesi et al, , 2022, and LNCC Santos Dumont supercomputer (SD) (Nesi et al,2022), and Riken Fugaku Fujitsu A64FX (Cao et al,2022) Distributed-Memory Systems…”
Section: Introductionmentioning
confidence: 99%