Proceedings of the 2010 ACM Symposium on Applied Computing 2010
DOI: 10.1145/1774088.1774174
|View full text |Cite
|
Sign up to set email alerts
|

Semi-join computation on distributed file systems using map-reduce-merge model

Abstract: Semi-join is the most used technique to optimize the treatment of complex relational queries on distributed architectures. However, the overhead related to semi-join computation can be very high due to data skew and to the high cost of communication in distributed architectures. Internet search engines needs to process vast amounts of raw data every day. Hence, systems that manage such data should assure scalability, reliability and availability issues with reasonable query processing time. Hadoop and Google's… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2011
2011
2016
2016

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…Join processing in MapReduce has become a hot research topic in recent years [8,3,16,2,22,30,11]. Many studies have been carried out to evaluate join queries and analyze large datasets in a MapReduce environment.…”
Section: Joins With Mapreducementioning
confidence: 99%
“…Join processing in MapReduce has become a hot research topic in recent years [8,3,16,2,22,30,11]. Many studies have been carried out to evaluate join queries and analyze large datasets in a MapReduce environment.…”
Section: Joins With Mapreducementioning
confidence: 99%
“…Hassan et. Al in [30] [31] tried to explain the working of Map Reduce by using certain algorithms. The working of Map-Reduce is divided into three main operations.…”
Section: Map-reducementioning
confidence: 99%
“…MapReduce [8] is an efficient programming model for processing mass volume data. Hadoop is an open sourced platform for cloud computing that is designed for the MapReduce model.…”
Section: Fair Scheduler Cloud Computingmentioning
confidence: 99%