2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2020
DOI: 10.1109/ipdps47924.2020.00087
|View full text |Cite
|
Sign up to set email alerts
|

What does Power Consumption Behavior of HPC Jobs Reveal? : Demystifying, Quantifying, and Predicting Power Consumption Characteristics

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 25 publications
(5 citation statements)
references
References 55 publications
0
5
0
Order By: Relevance
“…Patel et al [5] explore the power characteristics of typical HPC jobs during the approach to the Exascale era. As HPC systems become increasingly power constrained, they show that a data-driven approach to HPC application power characteristics can be used to make more effective use of HPC systems.…”
Section: Related Workmentioning
confidence: 99%
“…Patel et al [5] explore the power characteristics of typical HPC jobs during the approach to the Exascale era. As HPC systems become increasingly power constrained, they show that a data-driven approach to HPC application power characteristics can be used to make more effective use of HPC systems.…”
Section: Related Workmentioning
confidence: 99%
“…However, datacenters can expose thousands of signals, so even our broad selection imposes a bias. Finding a complete and general, holistic method of analysis is beyond the scope of this work-a goal which we envision for the entire community, for the next decade, which already includes awardwinning work that focuses on selecting meaningful signals [69] and large-scale data collection [62,44,40]. Furthermore, the method proposed here can be contrasted with methods from the other end of the holistic-reductionist spectrum; compared with focused work on even one of the questions we address, our method cannot produce the same depth for the same effort.…”
Section: Known Limitationsmentioning
confidence: 99%
“…From the relatively few traces that are shared publicly, many are focused on important but specific kinds of workloads, such as tightly-coupled parallel jobs [18], bags of tasks [26], and workflows [65]. Other datasets only include a limited subset of metrics such as power consumption [44], or high-level job information [43]. Only a handful of datasets include low-level server metrics, such as the Microsoft Azure serverless traces [49] or the Solvinity business-critical traces [50].…”
Section: Introductionmentioning
confidence: 99%
“…As deep learning (DL) workflows become more prevalent in the sciences, neural architecture search and hyperparameter sweeps consume an increasingly enormous * ncfrey@mit.edu amount of compute and power resources [3,5,27,28,39,44,48] at high-performance computing (HPC) centers [2,41,46] and cloud providers. While the cost per training step has decreased for deep neural networks (DNNs) due to optimized hardware and backend optimizations, overall costs have increased and training large models can reach into the millions of dollars [39].…”
Section: Introductionmentioning
confidence: 99%