Proceedings of the 50th Annual Southeast Regional Conference 2012
DOI: 10.1145/2184512.2184574
|View full text |Cite
|
Sign up to set email alerts
|

Application monitoring and checkpointing in HPC

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
11
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 20 publications
(12 citation statements)
references
References 13 publications
1
11
0
Order By: Relevance
“…Exascale requires new monitoring techniques, such as sub-optimal period scheduling [13] and strong usage of HPC system statistics [14] to improve system utilization. Thus, the effort on monitors development has been continuous in HPC systems.…”
Section: Related Workmentioning
confidence: 99%
“…Exascale requires new monitoring techniques, such as sub-optimal period scheduling [13] and strong usage of HPC system statistics [14] to improve system utilization. Thus, the effort on monitors development has been continuous in HPC systems.…”
Section: Related Workmentioning
confidence: 99%
“…Under such circumstances, a decoupled storage system (e.g. a parallel file system such as GPFS [3]) does not provide sufficient I/O bandwidth to handle the explosion of data sizes: for example, Jones et al [4] predict dump times in the order of several hours.…”
Section: Introductionmentioning
confidence: 99%
“…Under such circumstances, a decoupled storage system (e.g. a parallel file system such as GPFS [3] or a specialized storage system such as BlobSeer [4]) does not provide sufficient I/O bandwidth to handle the explosion of data sizes: for example, Jones et al [5] predict dump times in the order of several hours.…”
Section: Introductionmentioning
confidence: 99%