2007 IEEE 23rd International Conference on Data Engineering 2007
DOI: 10.1109/icde.2007.367893
|View full text |Cite
|
Sign up to set email alerts
|

Supporting Streaming Updates in an Active Data Warehouse

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
63
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 77 publications
(63 citation statements)
references
References 18 publications
0
63
0
Order By: Relevance
“…We do not consider the ordering of stream tuples in the join output as it is not important particularly in the scenario of data warehousing. To evaluate the approach, we combine the front-stage with two known semi-stream algorithms: (i) MESHJOIN [10,11], (ii) HYBRIDJOIN [12]. Our experimental results demonstrate the performance benefit of the approach.…”
mentioning
confidence: 97%
See 2 more Smart Citations
“…We do not consider the ordering of stream tuples in the join output as it is not important particularly in the scenario of data warehousing. To evaluate the approach, we combine the front-stage with two known semi-stream algorithms: (i) MESHJOIN [10,11], (ii) HYBRIDJOIN [12]. Our experimental results demonstrate the performance benefit of the approach.…”
mentioning
confidence: 97%
“…A characteristic of MESHJOIN is that it performs a staggered execution of the hash table build in order to load in stream tuples more steadily. The algorithm makes no assumptions about data distribution and the organization of the master data, and the MESHJOIN authors report that the algorithm performs worse with skewed data [10,11]. We previously presented a variant, R-MESHJOIN (reduced Mesh Join) [23], in order to clarify certain dependencies among the components of MESHJOIN.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The common case for updates to a data warehouse is the Extract-TransformLoad (ETL) processing [25], i.e., data are extracted from the sources and loaded to the data warehouse during specified time intervals. As applications are pushing for higher levels of freshness, data warehouses are updated as frequently as possible, giving rise to Active Data Warehousing [13,22,21].…”
Section: Previous Workmentioning
confidence: 99%
“…A novel stream-based equijoin algorithm, MESHJOIN (N. Polyzotis, Skiadopoulos, Vassiliadis, Simitsis, & Frantzell, 2007) (Neoklis Polyzotis, Skiadopoulos, Vassiliadis, Simitsis, & Frantzell, 2008) is in principle a hash join, where the stream serves as the build input and the diskbased relation serves as the probe input. The main contribution is a staggered execution of the hash table build and an optimization of the disk buffer for the disk-based relation.…”
Section: Figure 1: An Example Of Stream-based Joinmentioning
confidence: 99%