2015
DOI: 10.1142/s0129626415410030
|View full text |Cite
|
Sign up to set email alerts
|

Online Task Resource Consumption Prediction for Scientific Workflows

Abstract: Estimates of task runtime, disk space usage, and memory consumption, are commonly used by scheduling and resource provisioning algorithms to support efficient and reliable workflow executions. Such algorithms often assume that accurate estimates are available, but such estimates are difficult to generate in practice. In this work, we first profile five real scientific workflows, collecting fine-grained information such as process I/O, runtime, memory usage, and CPU utilization. We then propose a method to auto… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
38
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 51 publications
(38 citation statements)
references
References 46 publications
0
38
0
Order By: Relevance
“…Typically, mean values yield high values of standard deviation (due to variations inherent to the application itself, or the system including external load), thus estimations may not be accurate. Task characteristics estimation is beyond the scope of this work, and sophisticated methods to provide accurate estimates can be found in [22,23,24,25]. However, this work intends to demonstrate that even using inaccurate estimation methods, our proposed process can cope with the poor estimates and still yield good results.…”
Section: Decision Agentmentioning
confidence: 99%
See 3 more Smart Citations
“…Typically, mean values yield high values of standard deviation (due to variations inherent to the application itself, or the system including external load), thus estimations may not be accurate. Task characteristics estimation is beyond the scope of this work, and sophisticated methods to provide accurate estimates can be found in [22,23,24,25]. However, this work intends to demonstrate that even using inaccurate estimation methods, our proposed process can cope with the poor estimates and still yield good results.…”
Section: Decision Agentmentioning
confidence: 99%
“…Although several works address task requirement estimations based on provenance data [22,23,24,25], accurate estimations are still challenging, and may be specific to a certain type of application. In [26], a prediction algorithm based on machine learning (Naïve Bayes classifier) is proposed to identify faults before they occur, and to apply preventive actions to mitigate the faults.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To the best of our knowledge, this is the first work that predicts the runtime of workflow tasks using an online incremental learning approach. Hence, to compare our work with existing state-of-the-art solutions of task runtime prediction, we reproduce the batch offline learning work by da Silva et al [9] that makes use of a task's input data as a feature to predict the task runtime. We refer to this approach as the baseline scenario.…”
Section: Performance Evaluationmentioning
confidence: 99%