2011
DOI: 10.1007/s10586-011-0182-7
|View full text |Cite
|
Sign up to set email alerts
|

Breaking the MapReduce stage barrier

Abstract: The MapReduce model uses a barrier between the Map and Reduce stages. This provides simplicity in both programming and implementation. However, in many situations, this barrier hurts performance because it is overly restrictive. Thus, we develop a method to break the barrier in MapReduce in a way that improves efficiency. Careful design of our barrier-less MapReduce framework results in equivalent generality and retains ease of programming. We motivate our case with, and experimentally study our barrier-less t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(13 citation statements)
references
References 5 publications
0
13
0
Order By: Relevance
“…For example, two parents that are selected from the pool are (1,3,2,5,4,6) and (2,2,3,6,5,4). If crossover index is three, offsprings become (1, 3, 2, 6, 5, 4) and (2,2,3,5,4,6). The data dependency is checked by using the reordering rules presented in Section III-B.…”
Section: Generating New Membersmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, two parents that are selected from the pool are (1,3,2,5,4,6) and (2,2,3,6,5,4). If crossover index is three, offsprings become (1, 3, 2, 6, 5, 4) and (2,2,3,5,4,6). The data dependency is checked by using the reordering rules presented in Section III-B.…”
Section: Generating New Membersmentioning
confidence: 99%
“…Building more powerful compute units has been suggested in [2]- [4] for increasing scalability and reducing node inef-ficiency problems in future large scale processing systems. A recent publication [5] suggests more aggressive use of instruction extensible processors at compute nodes of large scale processing systems in order to tailor application for improved performance and efficiency.…”
Section: Introductionmentioning
confidence: 99%
“…One of the most important advantages of MapReduce is its convenience, such that, programmers can process massive data without knowing the details of distributed implementation, and users can process large scale of data by only providing the Map and Reduce interface. The Map and Reduce stage is strict in the original MapReduce model, but there are some works try to break the barrier [12], [13]. The initial MapReduce model was designed for off-line data processing.…”
Section: Introductionmentioning
confidence: 99%
“…The testing document t1 is most similar to the training document d3. performance [152]. The validity of barrier-less MapReduce model can be found in [152].…”
Section: Similarity Of Vsm Vectorsmentioning
confidence: 98%
“…performance [152]. The validity of barrier-less MapReduce model can be found in [152]. The proof of the equality of original and pipelined MapReduce models can be found in [35].…”
Section: Similarity Of Vsm Vectorsmentioning
confidence: 99%