2014
DOI: 10.1002/cpe.3333
|View full text |Cite
|
Sign up to set email alerts
|

Modeling and optimizing MapReduce programs

Abstract: MapReduce frameworks allow programmers to write distributed, dataparallel programs that operate on multisets. These frameworks offer considerable flexibility to support various kinds of programs and data. To understand the essence of the programming model better and to provide a rigorous foundation for optimizations, we present an abstract, functional model of MapReduce along with a number of customization options. We demonstrate that the MapReduce programming model can also represent programs that operate on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 37 publications
0
14
0
Order By: Relevance
“…With cost models, we can find performance bugs and optimize programs before execution or after little profiling. Such a cost model was also given by Dörre et al [10].…”
Section: One-pass Mapreduce Implementation For Scanbspmentioning
confidence: 95%
See 3 more Smart Citations
“…With cost models, we can find performance bugs and optimize programs before execution or after little profiling. Such a cost model was also given by Dörre et al [10].…”
Section: One-pass Mapreduce Implementation For Scanbspmentioning
confidence: 95%
“…Among several studies, two studies by Lämmel [15] and Dörre et al [10] gave detailed functional models of MapReduce computation.…”
Section: Differences From Previous Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In the last decade, new technologies based on the cloud model, offered to many organizations the possibility to store and analyze their data in an efficient way and a timely manner, which help them uncover patterns, get insight and provide better services. Hadoop, is an open source framework that offers a distributed storage layer, HDFS [1], tightly coupled with a distributed processing engine,MapReduce [2]. Hadoop allowsthe partitioning of data and computation across clusters of thousands of machines, in such a way, that each machine compute its local or neighbor's data.…”
Section: Introductionmentioning
confidence: 99%