2021
DOI: 10.1016/j.procs.2021.06.079
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent container orchestration techniques for batch and micro-batch processing and data transfer.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…Two libraries are standard in the Spark landscape: DStream API (RDD format) and Structured Streaming (DataFrame format). The Structured Streaming has been released in the Spark 2 version, providing DataFrame API usage on the streaming data [22]. The DataFrame API is an important feature; the study focuses on that direction since the data frame is more mature and feature-rich.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Two libraries are standard in the Spark landscape: DStream API (RDD format) and Structured Streaming (DataFrame format). The Structured Streaming has been released in the Spark 2 version, providing DataFrame API usage on the streaming data [22]. The DataFrame API is an important feature; the study focuses on that direction since the data frame is more mature and feature-rich.…”
Section: Discussionmentioning
confidence: 99%
“…Next to big data, the term "fast data" should also be evaluated. Three central data processing are considered on a time basis [22]. After getting a trigger, the batch processing method does it in query time.…”
Section: Optimizationmentioning
confidence: 99%
“…Ali and Wrembel [17] point out a key limitation in current ETL tools: the lack of efficient workflow development support due to missing automated optimization and finetuning capabilities. This gap often leads to the creation of bespoke ETL tools tailored to specific business needs [15,33,34]. The increasing data volume adds complexity to the ETL workflow design, elevating execution costs and risking operational delays or failures.…”
Section: Relevance To the Present Studymentioning
confidence: 99%
“…In batch processing, data are accumulated into groups or "batches" and processed collectively once a certain threshold or time limit is reached [34]. This approach, while an efficient and simplifying system design, can introduce latency due to the gap between data collection and processing.…”
Section: Micro-batch Data Processingmentioning
confidence: 99%
See 1 more Smart Citation