2016
DOI: 10.1109/tc.2015.2417526
|View full text |Cite
|
Sign up to set email alerts
|

DwarfCode: A Performance Prediction Tool for Parallel Applications

Abstract: We present DwarfCode, a performance prediction tool for MPI applications on diverse computing platforms. The goal is to accurately predict the running time of applications for task scheduling and job migration. First, DwarfCode collects the execution traces to record the computing and communication events. Then, it merges the traces from different processes into a single trace. After that, DwarfCode identifies and compresses the repeating patterns in the final trace to shrink the size of the events. Finally, a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0
1

Year Published

2016
2016
2023
2023

Publication Types

Select...
3
3
2

Relationship

3
5

Authors

Journals

citations
Cited by 27 publications
(10 citation statements)
references
References 42 publications
0
9
0
1
Order By: Relevance
“…A wide range of techniques have been proposed to accelerate convolution operations [1], [2], [3], [4], [5], [6], [7], [8]. Among these methods, general matrix multiplication (GEMM) [6], [7], fast fourier transform (FFT) [2] and winograd [3] methods are the broadly adopted ones.…”
Section: A Column Reuse Optimization 1) Standard Convolutionmentioning
confidence: 99%
“…A wide range of techniques have been proposed to accelerate convolution operations [1], [2], [3], [4], [5], [6], [7], [8]. Among these methods, general matrix multiplication (GEMM) [6], [7], fast fourier transform (FFT) [2] and winograd [3] methods are the broadly adopted ones.…”
Section: A Column Reuse Optimization 1) Standard Convolutionmentioning
confidence: 99%
“…It seems that these applications are not suitable for heterogeneous platforms. To solve this problem, a promising approach is to make the translator can predict appli-cation performances on different platforms [35,36,32,6], and automatically decide whether to offload or not.…”
Section: Performance Of Oao Versionmentioning
confidence: 99%
“…Trace-based methods: Trace-driven methods, which are frequently used in simulators [8], [7] and benchmark generation tools [24], [25], can capture detailed performance behavior and model the performance of parallel programs automatically. However, trace-driven methods have some limitations.…”
Section: Related Workmentioning
confidence: 99%