2016
DOI: 10.1007/s11227-016-1760-5
|View full text |Cite
|
Sign up to set email alerts
|

Characterizing and benchmarking stand-alone Hadoop MapReduce on modern HPC clusters

Abstract: With the emergence of high-performance data analytics, the Hadoop platform is being increasingly used to process data stored on high-performance computing clusters. While there is immense scope for improving the performance of Hadoop MapReduce (including the network-intensive shuffle phase) over these modern clusters, that are equipped with high-speed interconnects such as InfiniBand and 10/40 GigE, and storage systems such as SSDs and Lustre, it is essential to study the MapReduce component in an isolated man… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
(32 reference statements)
0
1
0
Order By: Relevance
“…MapReduce draws on the ideas in the Lisp functional language; Lisp defines the overall operation of the list elements and provides similar map and reduce operations. The map and reduce functions provide an abstract parallel programming interface for the system, which can quickly parallelize the data [3]. MapReduce abstracts computational processing into two operations map and reduce.…”
Section: Introductionmentioning
confidence: 99%
“…MapReduce draws on the ideas in the Lisp functional language; Lisp defines the overall operation of the list elements and provides similar map and reduce operations. The map and reduce functions provide an abstract parallel programming interface for the system, which can quickly parallelize the data [3]. MapReduce abstracts computational processing into two operations map and reduce.…”
Section: Introductionmentioning
confidence: 99%