2016 18th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC) 2016
DOI: 10.1109/synasc.2016.072
|View full text |Cite
|
Sign up to set email alerts
|

A Combined Analytical Modeling Machine Learning Approach for Performance Prediction of MapReduce Jobs in Cloud Environment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 20 publications
(24 citation statements)
references
References 27 publications
0
24
0
Order By: Relevance
“…What is worthwhile to note is that the previous formula represents a general relation describing either closed form results, as those presented in [15], based on ML [28], or the average execution times achieved via simulation: in this paper both the latter approaches are adopted.…”
Section: Problem Statementmentioning
confidence: 99%
See 2 more Smart Citations
“…What is worthwhile to note is that the previous formula represents a general relation describing either closed form results, as those presented in [15], based on ML [28], or the average execution times achieved via simulation: in this paper both the latter approaches are adopted.…”
Section: Problem Statementmentioning
confidence: 99%
“…Specifically, it is the result of a ML process to get a first order approximation of the execution time of Hadoop and Spark jobs in Cloud clusters. More in details, building upon [30], which compares linear regression, Gaussian support vector regression (SVR), polynomial SVR with degree ranging between 2 and 6, and linear SVR, this paper adopts a model learned with linear SVR, following [28]. This is due to the fact that SVR with other kinds of kernel fares worse than with the linear one, whilst plain linear regression requires specific data cleaning to avoid perfect multicollinearity in the design matrix, thus making it harder to apply in the greatest generality.…”
Section: Identifying An Initial Solutionmentioning
confidence: 99%
See 1 more Smart Citation
“…The other values set for the input parameters of the proposed model are presented in Table 4. Most of the values used herein as input parameters of the proposed SAN model are in the range of the values considered in other related work [10,11,15,22,29,36,37,39,42]. In order to compare different situations, the proposed model was evaluated in three modes: (1) pure power saving, (2) pure rejuvenation, and (3) with simultaneous application of rejuvenation and power saving, called power-aware rejuvenation.…”
Section: Performance Evaluationmentioning
confidence: 99%
“…The latter necessitates research on teletraffic loss or queueing models, either at call-level or at packet-level [2][3][4][5][6][7]. Such models not only assist in network optimization and dimensioning procedures but they may also be used in combination with machine learning techniques [8,9] or as an input to computational intelligent techniques, such as the fuzzy analytical hierarchy process techniques [10,11]. In this paper, we concentrate on call-level teletraffic loss models.…”
Section: Introductionmentioning
confidence: 99%