2020
DOI: 10.1109/access.2019.2962724
|View full text |Cite
|
Sign up to set email alerts
|

Ordinal Optimization-Based Performance Model Estimation Method for HDFS

Abstract: Modeling and analyzing the performance of distributed file systems (DFSs) benefit the reliability and quality of data processing in data-intensive applications. Hadoop Distributed File System (HDFS) is a typical representative of DFSs. Its internal heterogeneity and complexity as well as external disturbance contribute to HDFS's built-in features of nonlinearity as well as randomness in system level, which raises a great challenge in modeling these features. Particularly, the randomness results in the uncertai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 35 publications
0
4
0
Order By: Relevance
“…This not only reduces data access latency, but also provides load balancing of data access requests. This motivated the authors of [18], [19], [20] to investigate the performance of HDFS in remote data access.…”
Section: Related Workmentioning
confidence: 99%
“…This not only reduces data access latency, but also provides load balancing of data access requests. This motivated the authors of [18], [19], [20] to investigate the performance of HDFS in remote data access.…”
Section: Related Workmentioning
confidence: 99%
“…These assumptions frame the architecture of HDFS and allow the optimization of several aspects [7], especially data security and processing speed.…”
Section: B Hadoop Distributed File Systemmentioning
confidence: 99%
“…Many works have tried to improve the strategy of placing chunks in DFS. Some works have proposed to change the storage method by reducing the number of small files by grouping them into larger files [5], or to optimize the replication factor and the size of chunks to improve internal exchanges inside the cluster [6], [7]. Another approach is to try to assign the data to the best node according to the known physical capacities of each node.…”
Section: ) Related Workmentioning
confidence: 99%
“…22 OO has been widely-used in many complex optimization problems. Ma et al 23 incorporated the OO into piecewise linear algorithm to improve the performance optimization of distributed file systems. Ni et al 24 applied it to maximize core utilization in parallel computing.…”
Section: Introductionmentioning
confidence: 99%