2019
DOI: 10.1109/access.2019.2946884
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Distributed Data Stream Processing Frameworks

Abstract: Big data processing systems are evolving to be more stream oriented where each data record is processed as it arrives by distributed and low-latency computational frameworks on a continuous basis. As the stream processing technology matures and more organizations invest in digital transformations, new applications of stream analytics will be identified and implemented across a wide spectrum of industries. One of the challenges in developing a streaming analytics infrastructure is the difficulty in selecting th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
52
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 117 publications
(52 citation statements)
references
References 49 publications
0
52
0
Order By: Relevance
“…Once the paper is found, inclusion/exclusion of that paper has been decided after reading its abstract and then other parts of paper. After having examined selected papers thoroughly we identified one more study [5], and totally added up to 74 primary studies.…”
Section: ) Selection Based On Snowballingmentioning
confidence: 99%
See 2 more Smart Citations
“…Once the paper is found, inclusion/exclusion of that paper has been decided after reading its abstract and then other parts of paper. After having examined selected papers thoroughly we identified one more study [5], and totally added up to 74 primary studies.…”
Section: ) Selection Based On Snowballingmentioning
confidence: 99%
“…Big data analysis platforms and tools have been reviewed in a study [3] along with their applications, such as: Hadoop, GridGain, MapReduce, HPCC and Apache Storm. Whereas difficulties in selecting the right stream processing framework were identified and addressed for different use cases while developing a streaming analytics infrastructure by [5]. This study presents critical review of key features of some stream processing engines including Storm, Spark Streaming, Flink, Kafka Streams, IBM Steams.…”
Section: B Assessment Of Rq2: Which Challenges Have Been Faced Durinmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, an interesting extension of our approach is to use open-source distributed stream processing systems, like Storm, Flink, or Spark Streaming [67]- [69] to carry our real-time analytics. As shown in several works in literature (e.g., [70]- [72]), the common use-case exploits a distributed messaging system, like Apache Kafka [73], that runs on the IoT devices and sends data to dedicated servers hosting the streaming analytics software.…”
Section: Computational Load and Scalabilitymentioning
confidence: 99%
“…The core of a DSPS is the Data Stream Processing Engine, DSPE, which enables the definition and execution of stream processing pipelines [4]. A review of the state of the art shows many alternatives for DSPEs [5] whereas the number of existing DSPSs is very limited [4].…”
Section: Introductionmentioning
confidence: 99%