2007 IEEE 23rd International Conference on Data Engineering Workshop 2007
DOI: 10.1109/icdew.2007.4401054
|View full text |Cite
|
Sign up to set email alerts
|

Load Shedding for Window Joins on Multiple Data Streams

Abstract: We consider the problem of semantic load shedding for continuous queries containing window joins on multiple data streams and propose a robust approach that is effective with the different semantic accuracy criteria that are required in different applications. In fact, our approach can be used to (i) maximize the number of output tuples produced by joins, and (ii) optimize the accuracy of complex aggregates estimates under uniform random sampling. We first consider the problem of computing maximal subsets of a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 22 publications
0
13
0
Order By: Relevance
“…Thus, in the context of frequent pattern mining, pattern verification [18], and any other application that consists of only counting queries, we make the following observation: from (14) and (15) we notice that both the uniform and proportional 16 load shedding policies produce the same total variance (G uni = G prop ). However, the uniform approach is still more favorable since it does not require knowing the f k values, while the proportional method does.…”
Section: B Minimizing the Relative Errormentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, in the context of frequent pattern mining, pattern verification [18], and any other application that consists of only counting queries, we make the following observation: from (14) and (15) we notice that both the uniform and proportional 16 load shedding policies produce the same total variance (G uni = G prop ). However, the uniform approach is still more favorable since it does not require knowing the f k values, while the proportional method does.…”
Section: B Minimizing the Relative Errormentioning
confidence: 99%
“…The prior work has addressed the processing of join queries under load shedding [10], [11], [16], which usually involves adhoc heuristics. For aggregate queries, which is the focus of this paper, we instead use random load shedding.…”
Section: Related Workmentioning
confidence: 99%
“…They considered the MAX-subset measure, which maximizes the number of tuples in the approximate output of the join. The MAX-subset measure was considered for load shedding in many algorithms [40][41][42]. Das et al [26] also proposed two heuristics to determine the priority of tuples in an online join: PROB and LIFE.…”
Section: Load Sheddingmentioning
confidence: 99%
“…Their method focused on the memory-limited situation. Many other studies including [26,[40][41][42] also assumed the memory-limited situation, and then discussed their load shedding algorithms. On the other hand, Gedik et al [39,44] emphasized a situation where the CPU becomes a bottleneck (i.e., when an input arrival rate exceeds CPU processing speed), and then proposed load shedding techniques to shed the CPU load.…”
Section: Load Sheddingmentioning
confidence: 99%
See 1 more Smart Citation