Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale 2015
DOI: 10.1145/2751504.2751511
|View full text |Cite
|
Sign up to set email alerts
|

LogDiver

Abstract: This paper presents LogDiver, a tool for the analysis of application-level resiliency in extreme-scale computing systems. The tool has been implemented to handle data generated by system monitoring tools in Blue Waters, the petascale machine in production at the University of Illinois' National Center for Supercomputing Applications. The tool is able: i) to filter, extract, and classify error data from different sources of information, such as system logs, hardware sensors and workload logs; ii) to extract sig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 17 publications
references
References 17 publications
0
0
0
Order By: Relevance