2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2010
DOI: 10.1109/sc.2010.47
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles

Abstract: Abstract-Applications must scale well to make efficient use of today's class of petascale computers, which contain hundreds of thousands of processor cores. Inefficiencies that do not even appear in modest-scale executions can become major bottlenecks in large-scale executions. Because scaling problems are often difficult to diagnose, there is a critical need for scalable tools that guide scientists to the root causes of scaling problems.Load imbalance is one of the most common scaling problems. To provide act… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 65 publications
(28 citation statements)
references
References 24 publications
0
28
0
Order By: Relevance
“…Efficient, scalable measurement of load [12,24] identifies whether load imbalance is a problem for a particular application. Imbalance attribution [23] provides insight into the source code locations that cause imbalance. Our load model takes advantage of existing tools [10] and their measurements, and combines them with knowledge of the application elements and their interactions.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Efficient, scalable measurement of load [12,24] identifies whether load imbalance is a problem for a particular application. Imbalance attribution [23] provides insight into the source code locations that cause imbalance. Our load model takes advantage of existing tools [10] and their measurements, and combines them with knowledge of the application elements and their interactions.…”
Section: Related Workmentioning
confidence: 99%
“…These tools can provide insight into the source location that caused an imbalance [23] and into the distribution of the load, but this knowledge alone is insufficient to correct the load. Existing load metrics do not account for constraints on rebalancing imposed by application elements and their interaction.…”
Section: Introductionmentioning
confidence: 99%
“…CrayPat [3] calculates imbalance metrics from profiles that provide measures of absolute and relative load imbalance. HPC-TOOLKIT [23] attributes the costs of idleness at global synchronization points to overloaded call paths, highlighting imbalances in call-path profiles. However, profiling-based approaches, which aggregate performance data along the time dimension, can generally express only static imbalances reliably and do not capture dynamic load shifts between processes over time.…”
Section: Related Workmentioning
confidence: 99%
“…Prior work has investigated the parallel measurement of per-process load balance data and its attribution to source code [6,10,18,19]. Two of these techniques complement this work by enabling scalable parallel data collection for the type of per-process data we collect here, and by allowing code to be automatically sliced into phases and regions based on callpaths.…”
Section: Related Workmentioning
confidence: 99%