2013
DOI: 10.1007/978-3-642-35867-8_14
|View full text |Cite
|
Sign up to set email alerts
|

Comprehensive Workload Analysis and Modeling of a Petascale Supercomputer

Abstract: Abstract. The performance of supercomputer schedulers is greatly affected by the characteristics of the workload it serves. A good understanding of workload characteristics is always important to develop and evaluate different scheduling strategies for an HPC system. In this paper, we present a comprehensive analysis of the workload characteristics of Kraken, the world's fastest academic supercomputer and 11th on the latest Top500 list, with 112,896 compute cores and peak performance of 1.17 petaflops. In this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(13 citation statements)
references
References 16 publications
0
12
0
1
Order By: Relevance
“…Two-level scheduling models assume that the tasks run on them are largely short-lived and relinquish resource frequently, a valid assumption for data-intensive workloads [21], [22]. However, because HPC workloads are dominated by compute-bound and long-running applications [1], [23], two-level scheduling models are not applicable for HPC nodes.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Two-level scheduling models assume that the tasks run on them are largely short-lived and relinquish resource frequently, a valid assumption for data-intensive workloads [21], [22]. However, because HPC workloads are dominated by compute-bound and long-running applications [1], [23], two-level scheduling models are not applicable for HPC nodes.…”
Section: Related Workmentioning
confidence: 99%
“…Our simulators aim to address two main applications: data-intensive applications and traditional MPI applications. There are many approaches to partition workloads among clusters [1], [23]. To simplify and obtain the critical behavior characteristics of different scheduling models, we select a simple two-way split on top of our simulators between I/O-bound jobs that provide end-user operations (e.g., web services) and internal infrastructure services (e.g., BigTable), and compute-bound jobs that perform a computation.…”
Section: Workloadsmentioning
confidence: 99%
“…Since the dataintensive applications will converge with HPC applications in the future, our simulators aim at two main applications: data-intensive ones and traditional MPI ones [48]. There are many approaches to partitioning workloads among clusters [42,43,52,53]. To simplify and obtain the critical behavior characteristics of different schedulers, we select a simple twoway split on top of our simulators between I/O bound jobs that provide end-user operations (e.g., web services), internal infrastructure services (e.g., BigTable), and compute-bound jobs that perform a computation.…”
Section: Parameters For Simulation the Scheduler Decision Time (mentioning
confidence: 99%
“…As discussed [52,53], we assume that all tasks of jobs from HPC workloads are CPU-intensive and the tasks in a one-task job are sequential. For data-intensive jobs [43], we assume that a job is made up of one or more tasks (occasionally thousands of tasks), which are mostly CPU-intensive (> 80%) but with a fast turnaround, whereas the rest are I/O-intensive (< 20%) and consume the majority of resources.…”
Section: Workloads Workload Heterogeneitymentioning
confidence: 99%
“…This study not only helps to understand the behavior of current HPC systems but can also be used to develop new scheduling strategies for the forthcoming exascale systems. Similar studies mixing the characterization, modeling, and prediction of HPC workloads have been proposed in [5,17]. In [8], the authors proposed a complete reconguration of their batch scheduling systems, including the denition of the scheduling queues, based on a thorough analysis of the workload characteristics and detailed simulations.…”
Section: Introductionmentioning
confidence: 96%