Proceedings 2004 VLDB Conference 2004
DOI: 10.1016/b978-012088469-8/50031-0
|View full text |Cite
|
Sign up to set email alerts
|

Memory-Limited Execution of Windowed Stream Joins

Abstract: We address the problem of computing approximate answers to continuous sliding-window joins over data streams when the available memory may be insufficient to keep the entire join state. One approximation scenario is to provide a maximum subset of the result, with the objective of losing as few result tuples as possible. An alternative scenario is to provide a random sample of the join result, e.g., if the output of the join is being aggregated. We show formally that neither approximation can be addressed effec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
117
0

Year Published

2004
2004
2017
2017

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(117 citation statements)
references
References 1 publication
0
117
0
Order By: Relevance
“…Perhaps the most powerful formal approach is STREAM's CQL query language [25], which extends SQL with support for window queries. Like SQL itself, CQL is declarative and admits of a formal specification [6]; and there are some initial results characterizing a sub-class of queries that can be computed with bounded memory [28,5]. However, as we pointed out in the introduction, it is not clear whether SQL based languages with set semantics are suitable for real-time event detection and composition.…”
Section: Related Workmentioning
confidence: 99%
“…Perhaps the most powerful formal approach is STREAM's CQL query language [25], which extends SQL with support for window queries. Like SQL itself, CQL is declarative and admits of a formal specification [6]; and there are some initial results characterizing a sub-class of queries that can be computed with bounded memory [28,5]. However, as we pointed out in the introduction, it is not clear whether SQL based languages with set semantics are suitable for real-time event detection and composition.…”
Section: Related Workmentioning
confidence: 99%
“…Load shedding has been discussed for approximate stream join processing [29,30,31], but in these approaches all inputs of the join operator are streams. For semi-stream joins, load shedding has been first discussed in [11].…”
Section: Load Sheddingmentioning
confidence: 99%
“…This model answers the question "which possible candidate m-tuple in the input window will be processed at a given point in time?" The set of objects from which Φ picks its inputs is generally governed by a timeconstraint on the input objects: usually implemented in a time-windowed manner [12]. In this paper, we do not assume any specific windowing scheme.…”
Section: Fusion Operator Modelmentioning
confidence: 99%
“…In such a case, the node will need to shed (or discard) some of the input data. Most existing work on semantic load shedding [6,12,14] (see Section 5 for the related work) on the other hand tackle the overflow problem in stream databases by aiming to maximize the throughput or at last provide a random sample of the results. In many in-network data processing applications, such as ARIA [11], however, the bottleneck is the processing power of the operators and, in many cases, the throughput is constant.…”
Section: Introductionmentioning
confidence: 99%