2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2019
DOI: 10.1109/ipdps.2019.00077
|View full text |Cite
|
Sign up to set email alerts
|

Adapting Batch Scheduling to Workload Characteristics: What Can We Expect From Online Learning?

Abstract: Despite the impressive growth and size of super-computers, the computational power they provide still cannot match the demand. Efficient and fair resource allocation is a critical task. Super-computers use Resource and Job Management Systems to schedule applications, which is generally done by relying on generic index policies such as First Come First Served and Shortest Processing time First in combination with Backfilling strategies. Unfortunately, such generic policies often fail to exploit specific charact… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 20 publications
1
10
0
Order By: Relevance
“…WFP has cumulative values close to SPF and SAF, while FCFS has the worst values by a large margin in all cases. These results are consistent with previous comparisons of scheduling policies [14], [15].…”
Section: Overall Impact On Scheduling Performancesupporting
confidence: 93%
See 4 more Smart Citations
“…WFP has cumulative values close to SPF and SAF, while FCFS has the worst values by a large margin in all cases. These results are consistent with previous comparisons of scheduling policies [14], [15].…”
Section: Overall Impact On Scheduling Performancesupporting
confidence: 93%
“…Finally, they argue that training can take many months (or years) before it reaches a stable level when using a few features, which would prevent practical deployments. We also observed strong day-to-day performance variability [15] and the potential inefficiency of static policies learned from long past periods of time. These observations motivate the need for reactive online learning policy that can quickly adapt to rapid load variations.…”
Section: Related Workmentioning
confidence: 83%
See 3 more Smart Citations