2018 IEEE 34th International Conference on Data Engineering (ICDE) 2018
DOI: 10.1109/icde.2018.00156
|View full text |Cite
|
Sign up to set email alerts
|

Query and Resource Optimization: Bridging the Gap

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 7 publications
0
7
0
Order By: Relevance
“…The degree of parallelism (i.e., the number of machines or containers allocated for each operator) is a key factor in determining the runtime of queries in massively parallel databases [46], which implicitly depends on the partition count. This makes partition count as an important feature in determining the cost of an operator (as noted in Figures 5-6).…”
Section: Resource-aware Query Planningmentioning
confidence: 99%
See 2 more Smart Citations
“…The degree of parallelism (i.e., the number of machines or containers allocated for each operator) is a key factor in determining the runtime of queries in massively parallel databases [46], which implicitly depends on the partition count. This makes partition count as an important feature in determining the cost of an operator (as noted in Figures 5-6).…”
Section: Resource-aware Query Planningmentioning
confidence: 99%
“…This stems from the observation that while some prior works have considered learning models for predicting query execution times for a given physical plan in traditional databases [2,5,19,32], none of them have integrated learned models within a query optimizer for selecting physical plans. Moreover, in big data systems, resources (in particular the number of machines) play a significant role in cost estimation [46], making the integration even more challenging. Thus, we investigate the effects of learned cost models on query plans by extending the SCOPE query optimizer in a minimally invasive way for predicting costs in a resource-aware manner.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Unfortunately, these reactive approaches take several minutes to react [21] and many of the optimization opportunities may already be missed. Additionally, reactively adjusting resources during the course of a query execution could even lead to expensive changes in the query plan [38]. Therefore, apart from the reactive approaches, we also need predictive resource allocation to provide a good starting point in the first place.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we study predictive price-perf optimization in serverless query processing setting, i.e., resources are allocated and users are charged at the query level. We build on top of our prior work on the relationship between query performance and resources in Hive and Spark [38], predictive degree of parallelism in SQL Server [26], peak [36], adaptive [23] and optimal [32] allocation in SCOPE [25] jobs, and present an end-to-end framework for predictive price-perf optimization at the query level. We recently demonstrated this system design and concept [37], and in this paper we present more generalized models, detailed architecture, analysis, and results, apart from Spark optimizer extensions that combine both the predictive and the reactive approaches.…”
Section: Introductionmentioning
confidence: 99%