2021
DOI: 10.14778/3503585.3503586
|View full text |Cite
|
Sign up to set email alerts
|

Cardinality estimation in DBMS

Abstract: Cardinality estimation (CardEst) plays a significant role in generating high-quality query plans for a query optimizer in DBMS. In the last decade, an increasing number of advanced CardEst methods (especially ML-based) have been proposed with outstanding estimation accuracy and inference latency. However, there exists no study that systematically evaluates the quality of these methods and answer the fundamental problem: to what extent can these methods improve the performance of query optimizer in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 41 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…The cardinality estimation problem for database queries has recently been tackled using mainly classical machine learning algorithms [20]. While the results of these methods are more accurate than traditional non-machine learning-based approaches, they still suffer from high training and inference costs while learning multivariate correlations [15,48], and can mitigate these "side effects" only when using single-table statistics as an input [51]. This is still an unsolved problem, and therefore, it is uncertain if such models will prevail in real databases, especially in dynamic environments [48].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The cardinality estimation problem for database queries has recently been tackled using mainly classical machine learning algorithms [20]. While the results of these methods are more accurate than traditional non-machine learning-based approaches, they still suffer from high training and inference costs while learning multivariate correlations [15,48], and can mitigate these "side effects" only when using single-table statistics as an input [51]. This is still an unsolved problem, and therefore, it is uncertain if such models will prevail in real databases, especially in dynamic environments [48].…”
Section: Related Workmentioning
confidence: 99%
“…Learned cardinalities estimators (LCEs) are exactly attempting to mitigate these issues with the help of machine learning methods. Although deep learning seems to give significant improvement compared to classical methods on datasets with more complicated data distributions and join schemas, it still requires hundreds of millions of learnable parameters that are often hard to tune [15].…”
Section: Introductionmentioning
confidence: 99%
“…a query set can be cast as the multi-query-dataset cardinality estimation (MCE) problem. So, we also review existing work on single-query-dataset cardinality estimation (SCE), which can be categorized as querydriven methods and data-driven methods [21,30]. Query-driven methods [17,22,31,41,46,47,49,52,53] deploy discriminative models trained on a set of historic queries to predict the cardinality for a single query.…”
Section: Related Workmentioning
confidence: 99%
“…Data-driven methods [24,37,39,51,54,56,57,60] deploy generative models trained on data without using query workloads. In general, query-driven methods are inflexible, especially when representative queries are unavailable, and data-driven methods can achieve better performance than query-driven methods [21]. However, existing approaches to the SCE problem cannot address the MCE problem as they only estimate the cardinality of a single query over a single dataset while we estimate the cardinality for a query set over a set of datasets.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation