2022
DOI: 10.1609/aaai.v36i8.20800
|View full text |Cite
|
Sign up to set email alerts
|

Conditional Generative Model Based Predicate-Aware Query Approximation

Abstract: The goal of Approximate Query Processing (AQP) is to provide very fast but "accurate enough" results for costly aggregate queries thereby improving user experience in interactive exploration of large datasets. Recently proposed Machine-Learning-based AQP techniques can provide very low latency as query execution only involves model inference as compared to traditional query processing on database clusters. However, with increase in the number of filtering predicates (WHERE clauses), the approximation error sig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…To improve latency of interactions, downsampling the data has been proposed in various works target ting approximate query processing (AQP) (Chaudhuri, Ding, and Kan-Figure 1: Divergence of intent distribution: Single sampling strategy (stratified or uniform) for all the contexts in sequential data exploration can make user to get mislead into wrong analysis flow due to approximation errors. dula 2017; Park et al 2018;Sheoran et al 2022a) and visualizations (Park, Cafarella, and Mozafari 2016;Moritz et al 2017;Porwal et al 2022). For large datasets that are often a target for EDA (Galakatos et al 2017;Wang et al 2014), running queries on samples can substantially reduce the query latency.…”
Section: Introductionmentioning
confidence: 99%
“…To improve latency of interactions, downsampling the data has been proposed in various works target ting approximate query processing (AQP) (Chaudhuri, Ding, and Kan-Figure 1: Divergence of intent distribution: Single sampling strategy (stratified or uniform) for all the contexts in sequential data exploration can make user to get mislead into wrong analysis flow due to approximation errors. dula 2017; Park et al 2018;Sheoran et al 2022a) and visualizations (Park, Cafarella, and Mozafari 2016;Moritz et al 2017;Porwal et al 2022). For large datasets that are often a target for EDA (Galakatos et al 2017;Wang et al 2014), running queries on samples can substantially reduce the query latency.…”
Section: Introductionmentioning
confidence: 99%