2022
DOI: 10.1109/tcc.2019.2947674
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing Speculative Execution in Spark Heterogeneous Environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…A considerable portion of both analytical and learningbased approaches have reported the employment of sampling and micro-benchmarking. In several researches [20], [39], [40], [41], linear regression of selected sample executions are considered as the predictor for the actual-size performance of a Hadoop application. Based on a similar sampling approach, more sophisticated learning techniques have been adopted such as deep reinforcement learning [12] or combining multiple regression models each for a single stage of the whole application [13].…”
Section: Related Workmentioning
confidence: 99%
“…A considerable portion of both analytical and learningbased approaches have reported the employment of sampling and micro-benchmarking. In several researches [20], [39], [40], [41], linear regression of selected sample executions are considered as the predictor for the actual-size performance of a Hadoop application. Based on a similar sampling approach, more sophisticated learning techniques have been adopted such as deep reinforcement learning [12] or combining multiple regression models each for a single stage of the whole application [13].…”
Section: Related Workmentioning
confidence: 99%
“…Since every node's capability may vary, it is essential to have an appropriate metric to measure the performance of heterogeneity nodes. Therefore, the capability of a node can be obtained through the amount of tasks completed and total tasks processed as in ( 15) [33]:…”
Section: ) Backup Straggler Task On Proper Nodementioning
confidence: 99%
“…In this section, the performance of the proposed framework is assessed on a spark cluster with a diverse set of nodes. Also, the proposed framework is compared with Spark-Default, Spark-Speculation and the work in [33] (marked as Spark-ETWR) in various benchmarks at different input sizes. The performance is evaluated in terms of the job execution time that refers to the elapsed time from the beginning to the end of the job in seconds.…”
Section: Performance Evaluationmentioning
confidence: 99%