Breaking the MapReduce stage barrier

Verma, Abhishek; Zea, Nicolas; Cho, Brian; Gupta, Indranil; Campbell, Roy H.

doi:10.1007/s10586-011-0182-7

Cited by 35 publications

(13 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, two parents that are selected from the pool are (1,3,2,5,4,6) and (2,2,3,6,5,4). If crossover index is three, offsprings become (1, 3, 2, 6, 5, 4) and (2,2,3,5,4,6). The data dependency is checked by using the reordering rules presented in Section III-B.…”

Section: Generating New Membersmentioning

confidence: 99%

“…Building more powerful compute units has been suggested in [2]- [4] for increasing scalability and reducing node inef-ficiency problems in future large scale processing systems. A recent publication [5] suggests more aggressive use of instruction extensible processors at compute nodes of large scale processing systems in order to tailor application for improved performance and efficiency.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

MIPT: Rapid exploration and evaluation for migrating sequential algorithms to multiprocessing systems with multi-port memories

Malazgirt

Yurdakul

Niar

2014

2014 International Conference on High Performance Computing &Amp; Simulation (HPCS)

View full text Add to dashboard Cite

Research has shown that the memory load/store instructions consume an important part in execution time and energy consumption. Extracting available parallelism at different granularity has been an important approach for designing next generation highly parallel systems. In this work, we present MIPT, an architecture exploration framework that leverages instruction parallelism of memory and ALU operations from a sequential algorithm's execution trace. MIPT heuristics recommend memory port sizes and issue slot sizes for memory and ALU operations. Its custom simulator simulates and evaluates the recommended parallel version of the execution trace for measuring performance improvements versus dual port memory. MIPT's architecture exploration criteria is to improve performance by utilizing systems with multi-port memories and multi-issue ALUs. There exists design exploration tools such as Multi2Sim and Trimaran. These simulators offer customization of multi-port memory architectures but designers' initial starting points are usually unclear. Thus, MIPT can suggest initial starting point for customization in those design exploration systems. In addition, given same application with two different implementations, it is possible to compare their execution time by the MIPT simulator.

show abstract

Section: Generating New Membersmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

MIPT: Rapid exploration and evaluation for migrating sequential algorithms to multiprocessing systems with multi-port memories

Malazgirt

Yurdakul

Niar

2014

2014 International Conference on High Performance Computing &Amp; Simulation (HPCS)

View full text Add to dashboard Cite

show abstract

“…One of the most important advantages of MapReduce is its convenience, such that, programmers can process massive data without knowing the details of distributed implementation, and users can process large scale of data by only providing the Map and Reduce interface. The Map and Reduce stage is strict in the original MapReduce model, but there are some works try to break the barrier [12], [13]. The initial MapReduce model was designed for off-line data processing.…”

Section: Introductionmentioning

confidence: 99%

A MapReduce scheduling algorithm for time constraints in heterogeneous environment

Deng

2014

2014 10th International Conference on Natural Computation (ICNC)

View full text Add to dashboard Cite

In public Infrastructure-as-a-Service (IaaS), virtual machines, servers, storage, and network are provided by cloud service providers. As a cloud service provider, who is facing a task for time constraint, how to schedule the service resources to achieve the lowest cost becomes more and more important. Recently, most of works about MapReduce task scheduling are focus on homogeneous MapReduce framework. In this paper, we present the ILP formulation for solving the MapReduce task scheduling for time constrains problem in heterogeneous environment. This method considers processing speed, energy cost and time constrains at the same time. By using the method, we can finish the task in time and achieving lowest energy cost. Then, we solve this problem efficiently by using genetic algorithm(GA). According to our experimental results, the ILP formulation we proposed can always achieve the best solution, it also reduced the energy consumption by 10.15% compared to genetic algorithm.

show abstract

“…The testing document t1 is most similar to the training document d3. performance [152]. The validity of barrier-less MapReduce model can be found in [152].…”

Section: Similarity Of Vsm Vectorsmentioning

confidence: 98%

“…performance [152]. The validity of barrier-less MapReduce model can be found in [152]. The proof of the equality of original and pipelined MapReduce models can be found in [35].…”

Section: Similarity Of Vsm Vectorsmentioning

confidence: 99%

Analysis and acceleration of data mining algorithms on high performance reconfigurable computing platforms

Sun¹

View full text Add to dashboard Cite

Breaking the MapReduce stage barrier

Cited by 35 publications

References 5 publications

MIPT: Rapid exploration and evaluation for migrating sequential algorithms to multiprocessing systems with multi-port memories

MIPT: Rapid exploration and evaluation for migrating sequential algorithms to multiprocessing systems with multi-port memories

A MapReduce scheduling algorithm for time constraints in heterogeneous environment

Analysis and acceleration of data mining algorithms on high performance reconfigurable computing platforms

Contact Info

Product

Resources

About