Proceedings of the 13th ACM International Conference on Distributed and Event-Based Systems 2019
DOI: 10.1145/3328905.3332511
|View full text |Cite
|
Sign up to set email alerts
|

Generating Reproducible Out-of-Order Data Streams

Abstract: Evaluating modern stream processing systems in a reproducible manner requires data streams with different data distributions, data rates, and real-world characteristics such as delayed and outof-order tuples. In this paper, we present an open source stream generator which generates reproducible and deterministic out-oforder streams based on real data files, simulating arbitrary fractions of out-of-order tuples and their respective delays. CCS CONCEPTS • Information systems → Stream management.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 7 publications
0
4
0
Order By: Relevance
“…Consequently, the binary tree is updated by purging the expired p-value. The query result is computed by aggregating the t-value (16) and the g-value (15), which is 16.…”
Section: T-value = Updatet Ree(expv Alue)mentioning
confidence: 99%
See 1 more Smart Citation
“…Consequently, the binary tree is updated by purging the expired p-value. The query result is computed by aggregating the t-value (16) and the g-value (15), which is 16.…”
Section: T-value = Updatet Ree(expv Alue)mentioning
confidence: 99%
“…One practical issue is how to handle an infinite sequence of streaming records [5], [6], [7], [8], [9], [15] using finite memory (or computational resources). Using a window to extract partial data from an infinite stream is one solution as a window limits the evaluation scope given an operation over a sequence of records.…”
Section: Introductionmentioning
confidence: 99%
“…The open-source stream generator proposed by Grulich et al [27] enables the evaluation of modern stream processing systems. It produces deterministic data streams from arbitrary input data sets with different data rates, distributions, and characteristics such as the fraction of out-of-order tuples and their delay.…”
Section: Applicationsmentioning
confidence: 99%
“…Condor [48], Stream Generator [27], I2 [64], M4 [32] speed networks, to accelerate the performance of stream processing systems. Sect.…”
Section: Introductionmentioning
confidence: 99%