2018
DOI: 10.1002/cpe.4695
|View full text |Cite
|
Sign up to set email alerts
|

Skew‐aware online aggregation over joins through guided sampling

Abstract: Online aggregation is a query processing technique that returns approximate answers with error guarantees (in the form of confidence intervals) continuously during the query execution process.This approach offers users a suitable tradeoff between query efficiency and accuracy. The key issue of online aggregation is how to ensure a random sample collection's efficiency and effectiveness. However, the often-used "blind" sampling method does not adequately consider dataset statistics and other useful information,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…This becomes a glaring performance issue for skewed data distribution over joins. To alleviate this problem, Wang et al utilize data set statistics to propose a new “guided” sampling approach, which consists of a logic‐partition‐based weighted Gaussian sampling method tailored for the skewed join key, as well as a two‐level sample allocation method that applies to the skewed measured value …”
mentioning
confidence: 99%
“…This becomes a glaring performance issue for skewed data distribution over joins. To alleviate this problem, Wang et al utilize data set statistics to propose a new “guided” sampling approach, which consists of a logic‐partition‐based weighted Gaussian sampling method tailored for the skewed join key, as well as a two‐level sample allocation method that applies to the skewed measured value …”
mentioning
confidence: 99%