2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium 2015
DOI: 10.1109/hpcc-css-icess.2015.246
|View full text |Cite
|
Sign up to set email alerts
|

Performance Prediction for Apache Spark Platform

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
76
0
1

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 126 publications
(77 citation statements)
references
References 10 publications
0
76
0
1
Order By: Relevance
“…The performance of the shared‐memory computation programs can be predicted with the Tanzil et al model, whereas in those programs with remote direct memory access, the Wasi‐ur‐Rahman et al model can be used. Apache Spark programs process the data using distributed memory abstraction, and their performance can be predicted by a model that executes a sample of data …”
Section: Resultsmentioning
confidence: 99%
“…The performance of the shared‐memory computation programs can be predicted with the Tanzil et al model, whereas in those programs with remote direct memory access, the Wasi‐ur‐Rahman et al model can be used. Apache Spark programs process the data using distributed memory abstraction, and their performance can be predicted by a model that executes a sample of data …”
Section: Resultsmentioning
confidence: 99%
“…Tuning up its performance is an important concern of the community and yet there is not much related work. In [30], the authors present, to the best of our knowledge, the only Apache Spark prediction model. Again sampling the application with a smaller data size is used to get statistics about the duration of the tasks and plugged into a formula that gives an estimation of the total run time for a different file size.…”
Section: Related Workmentioning
confidence: 99%
“…Other works compare Apache Spark with other frameworks such as MapReduce [72], study the performance of Apache Spark for specific scenarios such as scale-up configuration [10], analyze the performance of Spark's programming model for large-scale data analytics [78] and identify the performance bottlenecks in Apache Spark [66] [11]. In addition, as Apache Spark offers language-integrated APIs, there are some efforts to provide the APIs in other languages.…”
Section: Related Researchmentioning
confidence: 99%