2018
DOI: 10.1007/s41019-018-0060-x
|View full text |Cite
|
Sign up to set email alerts
|

Approximate Calculation of Window Aggregate Functions via Global Random Sample

Abstract: Window functions have been a part of the SQL standard since 2003 and have been studied extensively during the past decade. They are widely used in data analysis; almost all the current mainstream commercial databases support window functions. However, in recent years the size of datasets is growing steeply; the existing window function implementations are not efficient enough. Recently, some sampling-based algorithms (e.g., online aggregation) are proposed to deal with large and complex data in relational data… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…Simultaneously, there was a surge in research within the related field of Approximate Query Processing (AQP). One notable alternative to cardinality estimation and AQP problems involved sampling-based methods [16][17][18]. Unlike synopsis-based approaches like histograms, these methods required maintaining a fixed-size or fixed-proportion sample of the entire dataset, relying on this sample to provide estimated query results.…”
Section: Related Workmentioning
confidence: 99%
“…Simultaneously, there was a surge in research within the related field of Approximate Query Processing (AQP). One notable alternative to cardinality estimation and AQP problems involved sampling-based methods [16][17][18]. Unlike synopsis-based approaches like histograms, these methods required maintaining a fixed-size or fixed-proportion sample of the entire dataset, relying on this sample to provide estimated query results.…”
Section: Related Workmentioning
confidence: 99%
“…Online Query Sampling techniques are widely used to support approximate query processing [84,101]. Given a time constraint in an AQP query, a sample size can be computed by estimating how many samples the system can process within the time constraint.…”
Section: Online Aggregationmentioning
confidence: 99%
“…Then, the sampling techniques can be used to get a set of samples and the selected samples are used to answer the future queries. As the distribution of many real-world datasets are proved to be uniform-like distribution and Gaussian-like distribution, early online sampling methods use random sampling to select samples [83,84,88,101]. Note that given the data-distribution assumption and random sampling can provide users with a confidence interval in most cases.…”
Section: Online Aggregationmentioning
confidence: 99%
“…Step 1 and 2 prepare the candidate splits for step 3 to convert feature values into bin indexes. Quantile sketch is a widely-used data structure for approximate query [25,34] and is usually small in size [15,22,14], so the network overhead is almost negligible. The communication bottleneck incurs in step 4.…”
Section: Horizontal-to-vertical Transformationmentioning
confidence: 99%