2021
DOI: 10.1109/tbdata.2019.2908188
|View full text |Cite
|
Sign up to set email alerts
|

Deadline-Aware Cost Optimization for Spark

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(15 citation statements)
references
References 26 publications
0
15
0
Order By: Relevance
“…As it can be observed via Table 4, several works predict execution time [13]- [15], [17], [31], [33], [63], while others estimate memory consumption [16], [18], [29]. It can be observed that in [13]- [18], [63], the prediction is done for big data workloads, whereas in [29], [31], [33], the considered applications are routine ones. In [14], a performance prediction framework Ernest is proposed, which can predict the execution time on a hardware configuration, given a job and its input.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…As it can be observed via Table 4, several works predict execution time [13]- [15], [17], [31], [33], [63], while others estimate memory consumption [16], [18], [29]. It can be observed that in [13]- [18], [63], the prediction is done for big data workloads, whereas in [29], [31], [33], the considered applications are routine ones. In [14], a performance prediction framework Ernest is proposed, which can predict the execution time on a hardware configuration, given a job and its input.…”
Section: Related Workmentioning
confidence: 99%
“…In [13], a machine learning approach is proposed for predicting the execution time of Spark applications. Similarly, an empirical model is proposed in [63], for estimating the completion time of the Spark job on a cloud, with respect to cluster nodes count, input data size, and the number of iterations. The model predicts the cost optimal cluster configuration for executing the Spark job on a cloud under the SLO (Service Level Objective) specified completion deadline.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To optimize T DP , modifications were applied to the Apache Spark Streaming process to satisfy the real-time delay bounds even when errors occur in the communication and data computing process. The SRSAF scheme proposed in this paper is different from the OptEx [25] and Spark adaptive failure-compensation (SAF) [26] schemes because of how it was designed to cope with errors in the communication and data computing process while satisfying the real-time delay bounds that support control of the UAVs in the HAP and LAP as well as the network data services.…”
Section: Optimization Of Uav Control Communication and Data Processingmentioning
confidence: 99%