DJSB: Dynamic Job Scheduling Benchmark

Proceedings of the 48th International Conference on Parallel Processing

Jokanović

Corbalan

2019

Self Cite

In job scheduling, the concept of malleability has been explored since many years ago. Research shows that malleability improves system performance, but its utilization in HPC never became widespread. The causes are the difficulty in developing malleable applications, and the lack of support and integration of the different layers of the HPC software stack. However, in the last years, malleability in job scheduling is becoming more critical because of the increasing complexity of hardware and workloads. In this context, using nodes in an exclusive mode is not always the most efficient solution as in traditional HPC jobs, where applications were highly tuned for static allocations, but offering zero flexibility to dynamic executions. This paper proposes a new holistic, dynamic job scheduling policy, Slowdown Driven (SD-Policy), which exploits the malleability of applications as the key technology to reduce the average slowdown and response time of jobs. SD-Policy is based on backfill and node sharing. It applies malleability to running jobs to make room for jobs that will run with a reduced set of resources, only when the estimated slowdown improves over the static approach. We implemented SD-Policy in SLURM and evaluated it in a real production environment, and with a simulator using workloads of up to 198K jobs. Results show better resource utilization with the reduction of makespan, response time, slowdown, and energy consumption, up to respectively 7%, 50%, 70%, and 6%, for the evaluated workloads. CCS CONCEPTS• Software and its engineering → Scheduling; • Computer systems organization → Multicore architectures.

Section: Related Workmentioning

confidence: 99%

Holistic Slowdown Driven Scheduling and Resource Management for Malleable Jobs

Proceedings of the 48th International Conference on Parallel Processing

Jokanović

Corbalan

2019

Self Cite

“…A similar approach was presented by [14], based on dynamically changing the operating system CPUSETs for MPI processes, but in this case, there was no integration with the programming model. This approach is equivalent to oversubscription of resources, i.e., more than one process running in the same core, which in general has a negative impact on the applications' performance, as demonstrated in [26]. In our integration, we used OpenMP/OmpSs programming models to adapt the number of threads to the change in the number of computing resources.…”

Section: Related Workmentioning

confidence: 99%

Drom

Proceedings of the 47th International Conference on Parallel Processing Companion

García-Gasulla

López

et al. 2018

Self Cite

In the design of future HPC systems, research in resource management is showing an increasing interest in a more dynamic control of the available resources. It has been proven that enabling the jobs to change the number of computing resources at run time, i.e. their malleability, can significantly improve HPC system performance. However, job schedulers and applications typically do not support malleability due to the common belief that it introduces additional programming complexity and performance impact. This paper presents DROM, an interface that provides efficient malleability with no effort for program developers. The running application is enabled to adapt the number of threads to the number of assigned computing resources in a completely transparent way to the user through the integration of DROM with standard programming models, such as OpenMP/OmpSs, and MPI. We designed the APIs to be easily used by any programming model, application and job scheduler or resource manager. Our experimental results from two realistic use cases analysis, based on malleability by reducing the number of cores a job is using per node and jobs co-allocation, show the potential of DROM for improving the performance of HPC systems. In particular, the workload of two MPI+OpenMP neuro-simulators are tested, reporting improvement in system metrics, such as total run time and average response time, up to 8% and 48%, respectively. CCS CONCEPTS• Software and its engineering → Software libraries and repositories;© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

“…methodologies for evaluating the efficiency and effectiveness of a job scheduler which we can separate in benchmarks and simulations. Benchmarks assume a real run of workloads in a cluster, with the purpose of evaluating well-known system metrics [10] or specific aspects of the system that administrator needs to optimize [11], such as the effect of dynamic job scheduling in the context of malleable jobs. However, it is not always possible to stop a production-machine to perform this type of evaluation, so usually, simulations are more convenient and practical to perform.…”

Section: Related Workmentioning

confidence: 99%

Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa

Jokanović

2018 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)

Corbalan

2018

Self Cite

Having a precise and a fast job scheduler model that resembles the real-machine job scheduling software behavior is extremely important in the field of job scheduling. The idea behind SLURM simulator is preserving the original code of the core SLURM functions while allowing for all the advantages of a simulator. Since 2011, SLURM simulator has passed through several iterations of improvements in different research centers. In this work, we present our latest improvements of SLURM simulator and perform the first-ever validation of the simulator on the real machine. In particular, we improved the simulator's performance for about 2.6 times, made the simulator deterministic across several same setup runs, and improved the simulator's accuracy; its deviation from the real-machine is lowered from previous 12% to at most 1.7%. Finally, we illustrate with several use cases the value of the simulator for job scheduling researchers, SLURM-system administrators, and SLURM developers.