Proceedings of the 18th International Conference on Distributed Computing and Networking 2017
DOI: 10.1145/3007748.3007768
|View full text |Cite
|
Sign up to set email alerts
|

Topology-aware resource management for HPC applications

Abstract: The Resource and Job Management System (RJMS) is a crucial system software part of the HPC stack. It is responsible for eciently delivering computing power to applications in supercomputing environments. Its main intelligence relies on resource selection techniques to nd the most adapted resources to schedule the users' jobs. Improper resource selection operations may lead to poor performance executions and global system utilization along with increase of system fragmentation and jobs starvation. These phenome… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 23 publications
0
10
0
Order By: Relevance
“…Solution ID Processors Memories 0 1 4 3 3 3 3 3 3 3 3 0 3 0 0 0 1 4 1 3 3 3 3 3 3 3 3 1 6 3 3 3 to determine the best choice among the available nodes based upon their position within the network [13], or emphasizing various targets, such as power-awareness [22] or resilience-awareness [5]. More recently, resource management has focused on the specific type of applications, such as MapReduce-based applications.…”
Section: Ldpc Ifs Solution Id Processors Memoriesmentioning
confidence: 99%
“…Solution ID Processors Memories 0 1 4 3 3 3 3 3 3 3 3 0 3 0 0 0 1 4 1 3 3 3 3 3 3 3 3 1 6 3 3 3 to determine the best choice among the available nodes based upon their position within the network [13], or emphasizing various targets, such as power-awareness [22] or resilience-awareness [5]. More recently, resource management has focused on the specific type of applications, such as MapReduce-based applications.…”
Section: Ldpc Ifs Solution Id Processors Memoriesmentioning
confidence: 99%
“…Some steps have been taken towards integrating more knowledge about the communication patterns of applications into batch schedulers. For instance, Georgiou et al studied the integration of TreeMatch into SLURM [9]. Given the communication matrix of an application, the scheduler minimizes the load of the network links by smartly mapping the application's processes on the resources.…”
Section: Related Workmentioning
confidence: 99%
“…Some steps have been taken towards integrating more knowledge about the communication patterns of applications into the batch scheduler. For example, Georgiou et al studied the integration of TreeMatch into SLURM [19]. Given the communication matrix of an application, the scheduler minimizes the load of the network links by smartly mapping the application's processes on the resources.…”
Section: Related Workmentioning
confidence: 99%