Improved Cardinality Estimation by Learning Queries Containment Rates

Hayek, Rojeh; Shmueli, Oded

doi:10.48550/arxiv.1908.07723

Cited by 2 publications

(2 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Machine Learning has been recently considered a promising technique for database query optimization, e.g. Leo [24], which adjusted histogram estimators by monitoring similar queries, [10][15][25] [28] proposed to use deep learning to learn cardinality estimations or query costs, in [1][2][3] cardinality estimation is based on query driven approaches, [29] [30] described unsupervised Monte Carlo-based solutions, in [8] cardinalities were estimated via query containment rates, [12][17] demonstrated that reinforcement learning helps find good query execution plans. In [10] the authors proposed to use a multi-set convolutional network to predict join-crossing correlations in the data.…”

Section: Related Workmentioning

confidence: 99%

Learning From User-Specified Optimizer Hints in Database Systems

Zakrzewicz

2024

Foundations of Computing and Decision Sciences

View full text Add to dashboard Cite

Recently, numerous machine learning (ML) techniques have been applied to address database performance management problems, including cardinality estimation, cost modeling, optimal join order prediction, hint generation, etc. In this paper, we focus on query optimizer hints employed by users in their queries in order to mask some Query Optimizer deficiencies. We treat the query optimizer hints, bound to previous queries, as significant additional query metadata and learn to automatically predict which new queries will pose similar performance challenges and should therefore also be supported by query optimizer hints. To validate our approach, we have performed a number of experiments using real-life SQL workloads and we achieved promising results.

show abstract

Section: Related Workmentioning

confidence: 99%

Learning From User-Specified Optimizer Hints in Database Systems

Zakrzewicz

2024

Foundations of Computing and Decision Sciences

View full text Add to dashboard Cite

show abstract

“…Unsupervised approaches, based on Monte Carlo integration, have also been proposed [116,117]. In [43], the authors present a scheme called CRN for estimating cardinalities via query containment rates.…”

Section: Learned Query Optimizationmentioning

confidence: 99%

Towards instance-optimized data systems

Kraska

2021

Proc. VLDB Endow.

View full text Add to dashboard Cite

In recent years, we have seen increased interest in applying machine learning to system problems. For example, there has been work on applying machine learning to improve query optimization, indexing, storage layouts, scheduling, log-structured merge trees, sorting, compression, and sketches, among many other data management tasks. Arguably, the ideas behind these techniques are similar: machine learning is used to model the data and/or workload in order to derive a more efficient algorithm or data structure. Ultimately, these techniques will allow us to build "instance-optimized" systems: that is, systems that self-adjust to a given workload and data distribution to provide unprecedented performance without the need for tuning by an administrator. While many of these techniques promise orders-of-magnitude better performance in lab settings, there is still general skepticism about how practical the current techniques really are. The following is intended as a progress report on ML for Systems and its readiness for real-world deployments, with a focus on our projects done as part of the Data Systems and AI Lab (DSAIL) at MIT By no means is it a comprehensive overview of all existing work, which has been steadily growing over the past several years not only in the database community but also in the systems, networking, theory, PL, and many other adjacent communities.

show abstract

Improved Cardinality Estimation by Learning Queries Containment Rates

Cited by 2 publications

References 20 publications

Learning From User-Specified Optimizer Hints in Database Systems

Learning From User-Specified Optimizer Hints in Database Systems

Towards instance-optimized data systems

Contact Info

Product

Resources

About