2021
DOI: 10.14778/3476249.3476285
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating approximate aggregation queries with expensive predicates

Abstract: Researchers and industry analysts are increasingly interested in computing aggregation queries over large, unstructured datasets with selective predicates that are computed using expensive deep neural networks (DNNs). As these DNNs are expensive and because many applications can tolerate approximate answers, analysts are interested in accelerating these queries via approximations. Unfortunately, standard approximate query processing techniques to accelerate such queries are not applicable because they assume t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 55 publications
0
12
0
Order By: Relevance
“…One method for showing that the bootstrap is valid is to demonstrate its asymptotic validity. The asymptotic validity of sampling with stochastic draws follows from the analysis in [28]. We can also use a standard subgaussian tail bound, but they give similar results.…”
Section: Inquest Algorithmmentioning
confidence: 84%
See 4 more Smart Citations
“…One method for showing that the bootstrap is valid is to demonstrate its asymptotic validity. The asymptotic validity of sampling with stochastic draws follows from the analysis in [28]. We can also use a standard subgaussian tail bound, but they give similar results.…”
Section: Inquest Algorithmmentioning
confidence: 84%
“…We model our syntax after the Apache Flink SQL language with some minor extensions [31]. Similar to unstructured AQP systems [24,25,27], a user provides InQuest with a sampling budget, a proxy model, ⊲ duration for non-continuous queries USING proxy Figure 2: Syntax for InQuest which is based on Apache Flink SQL syntax. Users provide a statistic to compute, a dataset, a segment length defined by a tumbling window, an oracle limit per segment, and a proxy model for computing proxy scores in real-time.…”
Section: Query Syntax and Semanticsmentioning
confidence: 99%
See 3 more Smart Citations