2021
DOI: 10.1109/tbdata.2016.2624274
|View full text |Cite
|
Sign up to set email alerts
|

ScaleJoin: A Deterministic, Disjoint-Parallel and Skew-Resilient Stream Join

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(16 citation statements)
references
References 31 publications
0
16
0
Order By: Relevance
“…Even assuming each input stream delivers its tuples in timestamp-order, it is still challenging to preserve determinism for operators that receive tuples from various streams, as they can be interleaved due to communication. A recently proposed mechanism to support deterministic processing is ScaleGate [8], [9], a parallel coordinator that efficiently supports merging of several timestamp-sorted streams, using fine-grained synchronization. It allows for an arbitrary number of reader entities to consume tuples from the merged stream while encapsulating the communication between sources and readers through defining two methods; (i) addTuple(tuple,sourceID), which allows source entity sourceID to add tuple to the Scale-Gate, and (ii) getNextReadyTuple(readerID), which provides the next tuple in the output stream that has not been consumed yet by the reader entity readerID.…”
Section: Data Stream Processing Backgroundmentioning
confidence: 99%
“…Even assuming each input stream delivers its tuples in timestamp-order, it is still challenging to preserve determinism for operators that receive tuples from various streams, as they can be interleaved due to communication. A recently proposed mechanism to support deterministic processing is ScaleGate [8], [9], a parallel coordinator that efficiently supports merging of several timestamp-sorted streams, using fine-grained synchronization. It allows for an arbitrary number of reader entities to consume tuples from the merged stream while encapsulating the communication between sources and readers through defining two methods; (i) addTuple(tuple,sourceID), which allows source entity sourceID to add tuple to the Scale-Gate, and (ii) getNextReadyTuple(readerID), which provides the next tuple in the output stream that has not been consumed yet by the reader entity readerID.…”
Section: Data Stream Processing Backgroundmentioning
confidence: 99%
“…We can observe that if an entire history H is SLT -compatible, then SSG behaves deterministically, as it returns only ready tuples as ScaleGate [60]. However, describing what happens in a history that consists of both SLT -compatible and SLT -incompatible sub-histories is not straightforward.…”
Section: Discussionmentioning
confidence: 99%
“…It is very important to guarantee the deterministic processing of input tuples for applications like click stream analysis and traffic monitoring due to the fact that non-determinism can cause money loss or missed events. Deterministic stream processing is achieved by merging the timestamp-sorted tuples coming from different streams and feeding the operator with a timestamp-sorted stream of ready [60] tuples. Definition 1.…”
Section: Determinism and Ready Tuplesmentioning
confidence: 99%
See 2 more Smart Citations