2012 IEEE 26th International Parallel and Distributed Processing Symposium 2012
DOI: 10.1109/ipdps.2012.120
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Critical-Path Based Performance Analysis

Abstract: Abstract-The critical path, which describes the longest execution sequence without wait states in a parallel program, identifies the activities that determine the overall program runtime. Combining knowledge of the critical path with traditional parallel profiles, we have defined a set of compact performance indicators that help answer a variety of important performance-analysis questions, such as identifying load imbalance, quantifying the impact of imbalance on runtime, and characterizing resource consumptio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
24
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 47 publications
(24 citation statements)
references
References 20 publications
0
24
0
Order By: Relevance
“…Based on the initial profile results, the measurement configuration can be optimized to reduce measurement perturbation, for example, by filtering small but frequently executed functions. In-depth analyses of the performance behavior can then be performed by collecting and automatically analyzing event traces, which allow one to distinguish between wait states and actual communication or synchronization time as well as to determine their root causes and activities on the critical path (Böhme et al, 2010(Böhme et al, , 2012.…”
Section: Performance Analysismentioning
confidence: 99%
“…Based on the initial profile results, the measurement configuration can be optimized to reduce measurement perturbation, for example, by filtering small but frequently executed functions. In-depth analyses of the performance behavior can then be performed by collecting and automatically analyzing event traces, which allow one to distinguish between wait states and actual communication or synchronization time as well as to determine their root causes and activities on the critical path (Böhme et al, 2010(Böhme et al, , 2012.…”
Section: Performance Analysismentioning
confidence: 99%
“…Power Wall The potential of exploiting the critical path has been already evaluated in MPI programs [6]. By using the knowledge of the critical path and combining it with straightforward profile-based techniques, a set of compact performance indicators that describe important performance-related questions, such as load imbalance, resource consumption or dynamic workload, an efficient scheduling can be derived.…”
Section: Memory Wallmentioning
confidence: 99%
“…Since the task-dependency graph can be generated ahead of time, it is possible to delay memory allocation to tasks until they start executing. This virtual resource allocation allows other tasks in the critical path to take advantage of this extra memory.Power Wall The potential of exploiting the critical path has been already evaluated in MPI programs [6]. By using the knowledge of the critical path and combining it with straightforward profile-based techniques, a set of compact performance indicators that describe important performance-related questions, such as load imbalance, resource consumption or dynamic workload, an efficient scheduling can be derived.…”
mentioning
confidence: 99%
“…In addition to the sheer scale, traces can be highly complex, making automatic analysis challenging. While there exist solutions to some problems, like identifying the critical execution path [36,7], in many cases the root causes of performance problems are more subtle than, for example, a single late process. Moreover, fully automatic techniques have been of limited use.…”
Section: Introductionmentioning
confidence: 99%