2014
DOI: 10.1145/2608199
|View full text |Cite
|
Sign up to set email alerts
|

Hadoop Extensions for Distributed Computing on Reconfigurable Active SSD Clusters

Abstract: In this article, we propose new extensions to Hadoop to enable clusters of reconfigurable active solid-state drives (RASSDs) to process streaming data from SSDs using FPGAs. We also develop an analytical model to estimate the performance of RASSD clusters running under Hadoop. Using the Hadoop RASSD platform and network simulators, we validate our design and demonstrate its impact on performance for different workloads taken from Stanford's Phoenix MapReduce project. Our results show that for a hardware accele… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…Hadoop (also known as Apache Hadoop) is an open-source framework for distributed storage and distributed processing on huge data sets into computer clusters [812]. For Hadoop, FPGAs' applications include energy-efficient acceleration of big data analytics [809], FPGA-accelerated Hadoop cluster for deep learning computations [813], process streaming data from SSDs (Solid State Drives) using FPGAs [814] and low-power Hadoop cluster [815].…”
Section: Big Datamentioning
confidence: 99%
“…Hadoop (also known as Apache Hadoop) is an open-source framework for distributed storage and distributed processing on huge data sets into computer clusters [812]. For Hadoop, FPGAs' applications include energy-efficient acceleration of big data analytics [809], FPGA-accelerated Hadoop cluster for deep learning computations [813], process streaming data from SSDs (Solid State Drives) using FPGAs [814] and low-power Hadoop cluster [815].…”
Section: Big Datamentioning
confidence: 99%
“…However, the advantages of SSD, such as stability, high bandwidth, fast response time, and low power consumption can be effectively used in Hadoop system [2]. As such, in order to secure price competitiveness taking advantage of performance, some Hadoop system has been adopting hybrid storage system that makes use of the advantages of SSD and HDD of the large capacity [3]. It is to add SSD to the storage device that consists only of HDD and let SSD play the role of cache unit.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, some works propose extensions to Hadoop with SSDs. For instance, [59] proposes extensions to enable clusters of reconfigurable active SSDs to process streaming data from SSDs using FPGAs. VENU [63] is a proposal for an extension to Hadoop that will use SSDs as a cache for the slower HDDs not for all data, but only for those that are expected to benefit from the use of SSDs.…”
Section: Related Workmentioning
confidence: 99%
“…Recently with the advent of faster Solid State Drives (SSDs) research is emerging to test and possibly to exploit the potential of the new technologically advanced drive [14], [59], [60], [68].…”
Section: Introductionmentioning
confidence: 99%