2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing (UCC) 2018
DOI: 10.1109/ucc.2018.00018
|View full text |Cite
|
Sign up to set email alerts
|

Task Runtime Prediction in Scientific Workflows Using an Online Incremental Learning Approach

Abstract: Many algorithms in workflow scheduling and resource provisioning rely on the performance estimation of tasks to produce a scheduling plan. A profiler that is capable of modeling the execution of tasks and predicting their runtime accurately, therefore, becomes an essential part of any Workflow Management System (WMS). With the emergence of multitenant Workflow as a Service (WaaS) platforms that use clouds for deploying scientific workflows, task runtime prediction becomes more challenging because it requires t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 38 publications
(28 citation statements)
references
References 26 publications
0
28
0
Order By: Relevance
“…The job length S is determined by the application-level workloads and the requesting service (e.g., VM types) [29]- [31]. Although S cannot be obtained precisely before job completion, it can be predicted via estimation technologies [31]- [35]. Particularly, the prediction method using machine learning in reference [32] has achieved a best-case estimation error of 1.6%.…”
Section: Traffic Modelmentioning
confidence: 99%
“…The job length S is determined by the application-level workloads and the requesting service (e.g., VM types) [29]- [31]. Although S cannot be obtained precisely before job completion, it can be predicted via estimation technologies [31]- [35]. Particularly, the prediction method using machine learning in reference [32] has achieved a best-case estimation error of 1.6%.…”
Section: Traffic Modelmentioning
confidence: 99%
“…e latest work by Sahoo et al (Sahoo et al 2019) a empts to make OMKR, an online and incremental machine learning approach, scalable to large time-series datasets in a near real-time process. While this approach is still intensively being studied for scienti c work ow area, the preliminary work on such an approach has been presented by Hilman et al (Hilman et al 2018) for future work ow as a service platform.…”
Section: Fast and Reliable Task Runtime Estimation In Near Real-time mentioning
confidence: 99%
“…Table 1 summarizes major similarities and differences between our work and the existing studies. First, the examined papers concern various application domains: load sharing facility (LSF) [8], parallel program [9], [11], [25], cloud [10], [29], HPC [2], [14], [15], [19], location-based services [20]- [23], databases [26]- [28], big data applications [29], [30], and scientific workloads [12], [13], [16]- [19]. The runtime estimation problem addressed in this paper applies to the scientific workloads domain.…”
Section: Related Workmentioning
confidence: 99%
“…Many of the existing studies [20]- [24], [28] build neural-net based learning models and utilize them for deriving estimated time; many others [2], [10], [16]- [19], [25], [30] use tree and linear regression based machine learning models. Some other works [11], [12], [14], [15] use hybrid methods combining these tools-analytical model, machine learning, and deep learning. In this article, we use what we consider the two most relevant of these works [16], [17] for performance comparison.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation