Social Big Data Analytics 2021
DOI: 10.1007/978-981-33-6652-7_2
|View full text |Cite
|
Sign up to set email alerts
|

Introduction to Big Data Technology

Abstract: Big data is no more "all just hype" but widely applied in nearly all aspects of our business, governments, and organizations with the technology stack of AI. Its influences are far beyond a simple technique innovation but involves all rears in the world. This chapter will first have historical review of big data; followed by discussion of characteristics of big data, i.e. from the 3V's to up 10V's of big data. The chapter then introduces technology stacks for an organization to build a big data application, fr… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 47 publications
0
6
0
Order By: Relevance
“…Furthermore, Abu-Salih et al (2021) claims that expensive, highly reliable hardware isn't required; instead, readily available hardware on the market is a viable option when constructing Hadoop Ecosystems. There is no Single Point of Failure (SPOF) issue in HDFS, because one of HDFS's design goals is to handle SPOF, so when a workstation fails in a Hadoop cluster, there will be no visible obstruction to users (Abu-Salih et al, 2021). One issue to be aware of is that HDFS may not be the best choice when an application demands low-latency data access (Abu-Salih et al, 2021).…”
Section: Hadoop Distributed File System (Hdfs)mentioning
confidence: 99%
See 4 more Smart Citations
“…Furthermore, Abu-Salih et al (2021) claims that expensive, highly reliable hardware isn't required; instead, readily available hardware on the market is a viable option when constructing Hadoop Ecosystems. There is no Single Point of Failure (SPOF) issue in HDFS, because one of HDFS's design goals is to handle SPOF, so when a workstation fails in a Hadoop cluster, there will be no visible obstruction to users (Abu-Salih et al, 2021). One issue to be aware of is that HDFS may not be the best choice when an application demands low-latency data access (Abu-Salih et al, 2021).…”
Section: Hadoop Distributed File System (Hdfs)mentioning
confidence: 99%
“…There is no Single Point of Failure (SPOF) issue in HDFS, because one of HDFS's design goals is to handle SPOF, so when a workstation fails in a Hadoop cluster, there will be no visible obstruction to users (Abu-Salih et al, 2021). One issue to be aware of is that HDFS may not be the best choice when an application demands low-latency data access (Abu-Salih et al, 2021). According to Abu-Salih et al (2021), if the data set comprises a large number of small files, it is not a good idea to utilize HDFS to store the data since HDFS stores files to a block that is often set to 128 MB (by default) or 256 MB, resulting in storage waste.…”
Section: Hadoop Distributed File System (Hdfs)mentioning
confidence: 99%
See 3 more Smart Citations