Proceedings of the 22nd ACM International Conference on Information &Amp; Knowledge Management 2013
DOI: 10.1145/2505515.2505728
|View full text |Cite
|
Sign up to set email alerts
|

A new operator for efficient stream-relation join processing in data streaming engines

Abstract: In the last decade, Stream Processing Engines (SPEs) have emerged as a new processing paradigm that can process huge amounts of data while retaining low latency and high-throughputs. Yet, it is often necessary to join streaming data with traditional databases to provide more contextual information for the end-users and applications. The major problem that we confront is to join the fast arriving stream tuples with the static relation tuples that are on a slow database. This is what we call the Stream-Relation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 11 publications
0
9
0
Order By: Relevance
“…The algorithm for joining stream data with a disk-based relation by Derakhshan et al [31] uses a cache to store frequent master data tuples and a waiting queue for stream tuples that are not joined through the cache. The algorithm processes this waiting queue in batches.…”
Section: B Index-based Semi-stream Joinmentioning
confidence: 99%
“…The algorithm for joining stream data with a disk-based relation by Derakhshan et al [31] uses a cache to store frequent master data tuples and a waiting queue for stream tuples that are not joined through the cache. The algorithm processes this waiting queue in batches.…”
Section: B Index-based Semi-stream Joinmentioning
confidence: 99%
“…Derakhshan et al [3] propose a cache-based method for the join between streaming data and a relation stored in a database under the record-at-a-time model in a centralized environment. So far, little attention has been paid to the stream-relation join processing under the micro-batch model in a distributed environment.…”
Section: Related Workmentioning
confidence: 99%
“…In order to interpret, enrich, and analyze the streaming data, streams need to be joined with data stored in relational or NoSQL databases (e.g., reference tables containing information about users or items) [2], [3]. To get meaningful information about an RFID tag ID, a Stream Processing Engine (SPE) must query the database to get the information about the ID [3]. To resolve shortened URLs in Tweets, an SPE needs to look up the expanded URLs stored in a database [4].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Stream relation join: Prior work on optimization of streamrelation joins for non-distributed streaming systems includes MeshJoin [20], Semi-Streaming Index Join (SSIJ) [2], CacheJoin [18], and a technique proposed by Derakhshan et al in [8].…”
Section: Related Workmentioning
confidence: 99%