2016 IEEE 36th International Conference on Distributed Computing Systems Workshops (ICDCSW) 2016
DOI: 10.1109/icdcsw.2016.17
|View full text |Cite
|
Sign up to set email alerts
|

Hadoop Based Scalable Cluster Deduplication for Big Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…Deduplication [10] Unique data storage is done using HDFS; Map Reduce is used to realize parallel deduplication processing.…”
Section: Chunk Levelmentioning
confidence: 99%
“…Deduplication [10] Unique data storage is done using HDFS; Map Reduce is used to realize parallel deduplication processing.…”
Section: Chunk Levelmentioning
confidence: 99%
“…Data deduplication technology generally divides files into chunks and calculates the fingerprint of each data chunk by a hash function. The fingerprints are compared to detect whether two data chunks are redundant or not [6]. Dedu-plication technology keeps only one duplicate to reduce the amount of data to save storage space during the data storage process [7], [8].…”
Section: Introductionmentioning
confidence: 99%