2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS) 2020
DOI: 10.1109/icdcs47774.2020.00047
|View full text |Cite
|
Sign up to set email alerts
|

Lynceus: Cost-efficient Tuning and Provisioning of Data Analytic Jobs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 23 publications
(16 citation statements)
references
References 27 publications
0
16
0
Order By: Relevance
“…Several research works target automatic cloud configuration optimization. Cherrypick [6], Arrow [27], Scout [28], Micky [26], Vanir [12] and Lynceus [16] perform cloud configuration optimization for distributed data analytics frameworks such as Spark and Hadoop. Ernest [41] creates an analytical model for Spark applications and uses that to optimize cloud configurations.…”
Section: Related Workmentioning
confidence: 99%
“…Several research works target automatic cloud configuration optimization. Cherrypick [6], Arrow [27], Scout [28], Micky [26], Vanir [12] and Lynceus [16] perform cloud configuration optimization for distributed data analytics frameworks such as Spark and Hadoop. Ernest [41] creates an analytical model for Spark applications and uses that to optimize cloud configurations.…”
Section: Related Workmentioning
confidence: 99%
“…load balancing, increase of output quality). We should note here the importance of cost models [3,9] for the evaluation of alternative plans, which play the role of a Knowledge Base and are continuously augmented with historical data from monitoring and reaction to past events.…”
Section: Aiops For Elastic Cloud To Edge Intelligencementioning
confidence: 99%
“…The problem is further exacerbated by the fact that the (distributed) training process of ML jobs exposes several hyper-parameters -such as the batch size considered in each training iteration or the frequency of synchronization among workers. The optimal configuration for these parameters can be substantially affected by the choice of the type and number of provisioned cloud resources [3].…”
Section: Introductionmentioning
confidence: 99%
“…Given the complexity of modelling the dynamics of modern ML and cloud platforms via white-box methods, a common approach in the literature is to rely on black-box modelling and Bayesian Optimization (BO) techniques [3]- [5]. These techniques have the key advantage of requiring no prior knowledge of the target ML model to be optimized, and as such require the target ML model to be deployed and trained in several (cloud/hyper-parameter) configurations.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation