2015 IEEE 40th Local Computer Networks Conference Workshops (LCN Workshops) 2015
DOI: 10.1109/lcnw.2015.7365918
|View full text |Cite
|
Sign up to set email alerts
|

Spark on entropy: A reliable & efficient scheduler for low-latency parallel jobs in heterogeneous cloud

Abstract: Abstract-In heterogeneous cloud, the provision of quality of service (QoS) guarantees for on-line parallel analysis jobs is much more challenging than off-line ones, mainly due to the many involved parameters, unstable resource performance, various job pattern and dynamic query workload. In this paper we propose an entropy-based scheduling strategy for running the on-line parallel analysis as a service more reliable and efficient, and implement the proposed idea in Spark.Entropy, as a measure of the degree of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 12 publications
0
6
0
Order By: Relevance
“…As shown through Formulas 1 and 2, these were then calculated, where j ∈ [0.1, 0.2, 0.3, 0.4, 0.5] represents the degree of complexity and i ∈ [5,6,7,8,9,10,11,12,13,14,15] represents the number of VMs.…”
Section: B Experiments Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…As shown through Formulas 1 and 2, these were then calculated, where j ∈ [0.1, 0.2, 0.3, 0.4, 0.5] represents the degree of complexity and i ∈ [5,6,7,8,9,10,11,12,13,14,15] represents the number of VMs.…”
Section: B Experiments Resultsmentioning
confidence: 99%
“…The reported scientific advances in both software platform development and Cloud Computing that enable fast data processing in the cloud is certainly a good news. Successful deployment of analysis engines such as Hive, Dremel, MapReduce, Spar and Impala has helped to run analysis jobs in short time across thousands of cloud resources [11]. However, adaptively scheduling groups of tasks based on dynamic changes in resource performance has been a challenge and remains unsolved.…”
Section: Introductionmentioning
confidence: 99%
“…In this section, we will compare TOPE with the existing approach FIFO, SpreadOut and Non-SpreadOut [26], Fair Scheduler [27] and Naïve Bayes Scheduler [28] through the following aspects: bandwidth resource cost, load balancing effect and energy consumption.…”
Section: Methodsmentioning
confidence: 99%
“…if the episode ends at the state s j+1 ; r j + γ max a Q(s , , a , , ω , ), otherwise. (27) where Q(s , a', ω ) is the Q-value of the target network. The parameters of the convolutional neural network are updated according to the gradient descent method.…”
Section: B Virtual Link Mapping Schemementioning
confidence: 99%
“…To evaluate the spread of the damages, we define damage D average (Difference of average workflow runtime R average ) and D std (Difference of workflow runtime Standard Deviation R std ) between two simulations results, which are calculated as shown in Formula 2 and 3, where i ∈ [5,6,7,8,9,10,11,12,13,14,15] As we can see from Figure 5, for number of VMs i < 10, the changes of D average for different degrees of complexity is relatively small, in this region, the damage is not spread and initial damage stays small.…”
Section: Order and Chaos In Complex Scheduling Systemmentioning
confidence: 99%