2014 First International Workshop on HPC User Support Tools 2014
DOI: 10.1109/hust.2014.7
|View full text |Cite
|
Sign up to set email alerts
|

Comprehensive Resource Use Monitoring for HPC Systems with TACC Stats

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 56 publications
(24 citation statements)
references
References 3 publications
0
23
0
Order By: Relevance
“…The TACC Stats [19] project also shares many of our goals. TACC Stats is used by admins for understanding, but the data is aggregated nightly and hence cannot be used for run-time and immediate post-run understanding.…”
Section: Related Workmentioning
confidence: 98%
“…The TACC Stats [19] project also shares many of our goals. TACC Stats is used by admins for understanding, but the data is aggregated nightly and hence cannot be used for run-time and immediate post-run understanding.…”
Section: Related Workmentioning
confidence: 98%
“…This cluster consists of 1888 computing nodes from which the records of 1709 nodes are used in this study. The resource usage data is collected using the TACC Stats system monitor [10] which records various resources usage statistics at each computational node every 10 minutes. In our experiments, we use a set of 86 resource usage statistics with a resolution of 10 minutes from 1:10:01 March 1 st 2013 to 23:40:01 March 7 th 2013.…”
Section: Datasetmentioning
confidence: 99%
“…Several tools exist for collecting and visualizing resource usage data from large scale HPC installations (e.g. Texas Advanced Computing Center TACC Stats [10], XSEDE Metrics on Demand or XDMoD [17], etc.). Such tools can produce large amounts of high dimensional resource usage data at a high temporal frequency for each computational node in the system.…”
Section: Introductionmentioning
confidence: 99%
“…Many of these solutions are site-or vendor-specific and are thus not easy to deploy at other sites. A solution that also targets small-to medium-sized clusters is TACC Stats [6], which is also used as part of the larger XDMoD project [7]. Recent and current efforts include the FEPA project [8], from which also the approach presented in this paper originates, and the just-started ProfiT-HPC [9].…”
Section: Related Workmentioning
confidence: 99%