2021
DOI: 10.9734/ajrcos/2021/v11i230260
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Survey for Hadoop Distributed File System

Abstract: In the last few days, data and the internet have become increasingly growing, occurring in big data. For these problems, there are many software frameworks used to increase the performance of the distributed system. This software is used for available ample data storage. One of the most beneficial software frameworks used to utilize data in distributed systems is Hadoop. This software creates machine clustering and formatting the work between them. Hadoop consists of two major components: Hadoop Distributed Fi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 48 publications
0
8
0
Order By: Relevance
“…Hadoop is a quasi-standard platform for distributed big data applications [88] . In the Hadoop MapReduce computing framework, an iterative algorithm is decomposed into a sequence of pairs of mapper and reducer operations, that are executed sequentially in a pipeline manner [89] . One pair of mapper and reducer operations is taken as an example.…”
Section: Hadoop Mapreducementioning
confidence: 99%
“…Hadoop is a quasi-standard platform for distributed big data applications [88] . In the Hadoop MapReduce computing framework, an iterative algorithm is decomposed into a sequence of pairs of mapper and reducer operations, that are executed sequentially in a pipeline manner [89] . One pair of mapper and reducer operations is taken as an example.…”
Section: Hadoop Mapreducementioning
confidence: 99%
“…In addition to MongoDB distributed storage tools, Hadoop distributed file system is also used to construct data storage subsystem ( Merceedi & Sabry, 2021 ). This is because with the operation of the rural tourism cloud platform, various types of data in the forum will become larger and larger as time goes on, and the efficiency of data retrieval by MongoDB NoSQL database will be lower and lower.…”
Section: Key Technologiesmentioning
confidence: 99%
“…HDFS adopts master/slave architecture. There is one Namenode node and multiple Datanode nodes in the HDFS cluster, while Namenode is used to manage Datanode, and Datanode is used to store actual data ( Merceedi & Sabry, 2021 ).…”
Section: Key Technologiesmentioning
confidence: 99%
“…Using HADOOP, the author [11] of the research accomplished similar work in her paper by analyzing the data from Twitter, which is also known as big data. The analytics tools and methods that the author of the study utilised are insufficient to manage huge data, however they are used in the paper [12]. Because of this, there is a prerequisite of making use of cloud storage for the kind of applications being discussed here.…”
Section: Literaturementioning
confidence: 99%