2013
DOI: 10.1145/2522968.2522979
|View full text |Cite
|
Sign up to set email alerts
|

The family of mapreduce and large-scale data processing systems

Abstract: In the last two decades, the continuous increase of computational power has produced an overwhelming flow of data which has called for a paradigm shift in the computing architecture and large scale data processing mechanisms. MapReduce is a simple and powerful programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines. It isolates the application from the details of running a distributed program such as issues on dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 142 publications
(44 citation statements)
references
References 114 publications
0
44
0
Order By: Relevance
“…However, such frameworks are not efficient for directly implementing iterative graph algorithms which often require multiple stages of complex joins [37]. In addition, the general-purpose join and aggregation mechanisms defined in such distributed frameworks are not designed to leverage the common patterns and structure in iterative graph algorithms.…”
Section: Hadoop-based Systemsmentioning
confidence: 99%
See 2 more Smart Citations
“…However, such frameworks are not efficient for directly implementing iterative graph algorithms which often require multiple stages of complex joins [37]. In addition, the general-purpose join and aggregation mechanisms defined in such distributed frameworks are not designed to leverage the common patterns and structure in iterative graph algorithms.…”
Section: Hadoop-based Systemsmentioning
confidence: 99%
“…The popular MapReduce framework [10] and its open source realization, Hadoop, 2 together with its associated ecosystem (e.g., Pig, 3 Hive 4 ) represent the pervasive technology for big data processing [37]. In principle, the MapReduce framework provides a simple but powerful programming model that enables developers to easily build scalable parallel algorithms to process massive amounts of data on clusters of commodity machines.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Zhao et al (2014) provided a comprehensive survey for a family of approaches and mechanisms of large scale data processing mechanisms that have been implemented based on the original idea of the MapReduce framework. Sakr, Liu, and Fayoumi (2013) surveyed the MapReduce framework's variants and its extensions for large scale data processing.…”
Section: Big Data and Big Data Analyticsmentioning
confidence: 99%
“…To make our presentation self-contained, we briefly review here these concepts and the associated notations. The interested reader may find in [23] a survey of NoSQL systems, while [32,73] cover MapReduce variations and other massively parallel data management frameworks.…”
Section: Building Blocks: Key-value Stores and Mapreduce Systemsmentioning
confidence: 99%