2020 28th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) 2020
DOI: 10.1109/pdp50117.2020.00032
|View full text |Cite
|
Sign up to set email alerts
|

Performance study of HPC applications on an Arm-based cluster using a generic efficiency model

Abstract: HPC systems and parallel applications are increasing their complexity. Therefore the possibility of easily study and project at large scale the performance of scientific applications is of paramount importance. In this paper we describe a performance analysis method and we apply it to four complex HPC applications. We perform our study on a pre-production HPC system powered by the latest Arm-based CPUs for HPC, the Marvell ThunderX2. For each application we spot inefficiencies and factors that limit their scal… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…From the description of the UEABS, the benchmark suite codes are "[...] scalable, currently relevant and publicly available[...], of a size which can realistically run on large systems and maintained into the future". Moreover, from previous works, we know that, in particular input sets, Alya can present a severe load imbalance problem [10] while CP2K shows a communication issue that can limit its scalability [1]. With this selection, we covered the two main metrics we want to demonstrate with TALP: Load balance and Communication efficiency.…”
Section: Environment and Methodologymentioning
confidence: 99%
“…From the description of the UEABS, the benchmark suite codes are "[...] scalable, currently relevant and publicly available[...], of a size which can realistically run on large systems and maintained into the future". Moreover, from previous works, we know that, in particular input sets, Alya can present a severe load imbalance problem [10] while CP2K shows a communication issue that can limit its scalability [1]. With this selection, we covered the two main metrics we want to demonstrate with TALP: Load balance and Communication efficiency.…”
Section: Environment and Methodologymentioning
confidence: 99%
“…Containers are used in HPC due to portability, composability, and BYOE advantages they offer [221]. Research on performance analysis of HPC applications based on ARM architectures using various ARM-based settings including mobile and cluster evaluated the performance of HPC applications based on portability, energy efficiency, and scalability [222]- [226]. Standard containers like Docker are not suitable for HPC, and this paved the way for the development of HPC-specific container technologies like Singularity [102], Shifter [227], Charliecloud [228], and Sarus [229].…”
Section: A Containers In Hpcmentioning
confidence: 99%
“…This methodology is independent of the tool being used for the analysis and defines a set of performance metrics. These metrics are well defined, accepted by the community, and meaningful, pointing the analysts to the main factors affecting the performance and scalability of the code [49,19]. In this paper, we use some of these metrics as they allow us to compare the different LAMMPS benchmarks using a common ground.…”
Section: Performance Tools and Efficiency Metricsmentioning
confidence: 99%