2018 IEEE 26th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2018
DOI: 10.1109/mascots.2018.00016
|View full text |Cite
|
Sign up to set email alerts
|

A Robust Fault-Tolerant and Scalable Cluster-Wide Deduplication for Shared-Nothing Storage Systems

Abstract: Deduplication has been largely employed in distributed storage systems to improve space efficiency. Traditional deduplication research ignores the design specifications of shared-nothing distributed storage systems such as no central metadata bottleneck, scalability, and storage rebalancing. Further, deduplication introduces transactional changes, which are prone to errors in the event of a system failure, resulting in inconsistencies in data and deduplication metadata. In this paper, we propose a robust, faul… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(15 citation statements)
references
References 10 publications
0
15
0
Order By: Relevance
“…The decision to select the deduplication mode lies within the administrators' choice and the needs of their systems. In this paper, since inline deduplication provides instant storage space savings, we adopted a recent inline cluster-wide inline deduplication design proposed in [7].…”
Section: Deduplication Storage Clustermentioning
confidence: 99%
See 2 more Smart Citations
“…The decision to select the deduplication mode lies within the administrators' choice and the needs of their systems. In this paper, since inline deduplication provides instant storage space savings, we adopted a recent inline cluster-wide inline deduplication design proposed in [7].…”
Section: Deduplication Storage Clustermentioning
confidence: 99%
“…Since cluster-wide deduplication has been adopted in the general storage community [7], [8], [9], [10], [11], [12], [13], [14], it is necessary to extend its advantages to the ML and DL storage architectures without affecting performance. Deduplication improves the storage capacity by removing duplicate data across the cluster.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Khan et al [ 93 ] presented a robust, fault-tolerant, and scalable cluster-wide deduplication that was able to eliminate duplicate copies across the entire cluster. The evaluation showed great savings in disk space with minimal performance degradation, as well as great robustness in the event of sudden server failure.…”
Section: Description Of the Set Of Work By Categorymentioning
confidence: 99%
“…To meet HDFS application equipment, deduplication schemes are proposed as necessary. Khan et al [31] proposed a multilevel pattern-matching algorithm for deduplication of big data. MLPMA is implemented based on the similarity and locality of data, and the Bloom filter is applied to the deduplication cluster to achieve effective data deletion.…”
Section: Hdfs Hash Functionmentioning
confidence: 99%