Proceedings of the International Workshop on Real-Time Business Intelligence and Analytics 2018
DOI: 10.1145/3242153.3242157
|View full text |Cite
|
Sign up to set email alerts
|

Real-time ETL in Striim

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
1
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 10 publications
0
1
0
Order By: Relevance
“…The authors present a view selection methodology with three components: the main memory and cache management; the dynamic selection of MV (materialized views); and a stream ETL process: starts by extracting the instances from the RDF data sources and saving them temporarily in the buffer of inputs. In [31], the authors present a new distributed ETL architecture, Striim developed to collect datasets from different sources and transform them to aggregate without considering the same or different timestamps. The ETL engine runs in a scalable and fault-tolerant cluster of compute nodes.…”
Section: Related Workmentioning
confidence: 99%
“…The authors present a view selection methodology with three components: the main memory and cache management; the dynamic selection of MV (materialized views); and a stream ETL process: starts by extracting the instances from the RDF data sources and saving them temporarily in the buffer of inputs. In [31], the authors present a new distributed ETL architecture, Striim developed to collect datasets from different sources and transform them to aggregate without considering the same or different timestamps. The ETL engine runs in a scalable and fault-tolerant cluster of compute nodes.…”
Section: Related Workmentioning
confidence: 99%
“…Boury-Brisset [18], propose the design and implementation of a framework based on scalable multiintelligence data integration services to facilitate the integration of heterogeneous unstructured and structured data. Pareek et al [19] presented a new distributed architecture of ETL Striim developed to collect datasets from different sources. The ETL engine runs on a scalable and fault-tolerant cluster of compute nodes.…”
Section: Related Workmentioning
confidence: 99%
“…Lastly, data output involves loading transformed data immediately into target systems for further analysis, storage, or visualization. Output systems may include databases, data warehouses, or other analytics platforms, depending on the specific requirements of the application [19]. In this section, we will mention some works related to our article that served as either direct or indirect inspiration and solved similar problems or utilized the same technologies as our team.…”
Section: Real-time Data Processing Toolsmentioning
confidence: 99%