2013
DOI: 10.1016/j.parco.2013.05.004
|View full text |Cite
|
Sign up to set email alerts
|

Framework for a productive performance optimization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 24 publications
(19 citation statements)
references
References 11 publications
0
19
0
Order By: Relevance
“…As a first step in the performance analysis, we have executed the airplane test case of 31.5M (coarse mesh) on 320 MPI processes using 4 nodes of the POWER9 cluster, running only on the CPUs. For this analysis we have used Extrae [30] to obtain a trace of the execution and Paraver [31] to visualize it (the overhead introduced by the tracing tools is under 4% [32] Figure 4 shows a timeline of a real execution of the airplane simulation. The X-axis represents the time, and the Y-axis represents the different MPI processes.…”
Section: Performance Characterizationmentioning
confidence: 99%
“…As a first step in the performance analysis, we have executed the airplane test case of 31.5M (coarse mesh) on 320 MPI processes using 4 nodes of the POWER9 cluster, running only on the CPUs. For this analysis we have used Extrae [30] to obtain a trace of the execution and Paraver [31] to visualize it (the overhead introduced by the tracing tools is under 4% [32] Figure 4 shows a timeline of a real execution of the airplane simulation. The X-axis represents the time, and the Y-axis represents the different MPI processes.…”
Section: Performance Characterizationmentioning
confidence: 99%
“…Extrae [10] is a tracing tool developed at BSC. It collects information such as PAPI counters, MPI and OpenMP calls during the execution of an application.…”
Section: B Performance Toolsmentioning
confidence: 99%
“…In this section we study a trace of the Alya simulation gathered on one node of the Thunder cluster, introduced in Section 4.2 (i.e., running with 96 MPI processes in the same cluster evaluated in Section 4). We use Extrae [22] to obtain a performance trace and then Paraver [19] to visualize it. In this simulation 4 • 10 5 particles where injected in the respiratory system during the first time step.…”
Section: Profile and Performance Analysismentioning
confidence: 99%