2010
DOI: 10.1002/cpe.v22:6
|View full text |Cite
|
Sign up to set email alerts
|

Untitled

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 174 publications
(22 citation statements)
references
References 70 publications
0
22
0
Order By: Relevance
“…Several performance tools use measurement for the purposes of offline performance analysis, including TAU [32], HPCToolkit [1], Scalasca [38], Vampir [20], Extrae [27] and others. All are powerful and capable tools in their own right.…”
Section: Related Workmentioning
confidence: 99%
“…Several performance tools use measurement for the purposes of offline performance analysis, including TAU [32], HPCToolkit [1], Scalasca [38], Vampir [20], Extrae [27] and others. All are powerful and capable tools in their own right.…”
Section: Related Workmentioning
confidence: 99%
“…Numbers such as the branch prediction ratio or the L1 cache miss rate are of little help in finding sources of inefficiency. The best modern tools will either attribute such numbers to high-level code items, such as source lines or variables [1], or plot timelines [14]. This will show the user where inefficiencies are occurring but will not provide enough information to clearly identify the issue.…”
Section: Examplesmentioning
confidence: 99%
“…Popular existing tools, such as HPCToolkit [1], are certainly helpful for finding where possible bottlenecks might be, even down to the source line, or with data-centric tools [2], down to a variable. However, knowing that execution time, cache misses, or branch mispredictions are associated with a particular line of code or variable is not enough to identify many issues.…”
Section: Introductionmentioning
confidence: 99%
“…By checkpointing a program's state at regular intervals, the amount of lost computation is limited to the interval from the last checkpoint to the time of crash. There are several functional tools and libraries of checkpointing in the literature, such as CryoPID [55], Distributed MultiThreaded CheckPointing (DMTCP) [16], ComPiler for Portable Checkpointing (CPPC) [163] or OpenVZ [153]. Berkeley Lab Checkpoint/Restart (BLCR) [65], a hybrid kernel/user implementation that provides a robust, production quality implementation that checkpoints applications without requiring changes in the their code, is currently one of the most used tools of checkpointing.…”
Section: Summary Of the State Of Artmentioning
confidence: 99%
“…User space checkpointing packages are highly 82 portable and can typically be compiled and run on any modern Unix. Examples of tools that implement this type of approach are CryoPID [55], that allows the user to capture the state of a running process in Linux and save it to a file, DMTCP [16], a tool to transparently checkpoint the state of multi-threaded and distributed applications, or CPPC [163], a checkpointing tool focused on the insertion of fault tolerance into long-running message-passing applications. HTCondor [91] offers user-level checkpointing and other tools for batch job scheduling.…”
Section: Application Migrationmentioning
confidence: 99%