2010
DOI: 10.1145/1837915.1837921
|View full text |Cite
|
Sign up to set email alerts
|

I/O Deduplication

Abstract: Duplication of data in storage systems is becoming increasingly common. We introduce I/O Deduplication, a storage optimization that utilizes content similarity for improving I/O performance by eliminating I/O operations and reducing the mechanical delays during I/O operations. I/O Deduplication consists of three main techniques: content-based caching, dynamic replica retrieval, and selective duplication. Each of these techniques is motivated by our observations with I/O workload traces obtained from actively-u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 173 publications
(12 citation statements)
references
References 19 publications
0
12
0
Order By: Relevance
“…We select three data sets to test our model, which are MSR, 35 VDI‐LUN, 36 and KC. MSR data set contains 1‐week block I/O traces of enterprise servers on 13 servers at Microsoft Research Cambridge.…”
Section: Methodsmentioning
confidence: 99%
“…We select three data sets to test our model, which are MSR, 35 VDI‐LUN, 36 and KC. MSR data set contains 1‐week block I/O traces of enterprise servers on 13 servers at Microsoft Research Cambridge.…”
Section: Methodsmentioning
confidence: 99%
“…MSNfs is the I/O trace gathered in the MSN Storage back-end file server [Kavalanekar et al 2008]. The Homes workload is collected from the home directory in an NFS server, which consists of several research group activities such as developing, testing, technical writing, and plotting [Koller and Rangaswami 2010]. The remaining three workloads, FileZilla, Torrent, and GIMP, are from in-house traces [ESOSLab 2012].…”
Section: Case Studiesmentioning
confidence: 99%
“…They propose that there should be an interface to inform the SSD of its Power Budget and that the firmware of an SSD should be designed to dynamically adjust the parallelism degree subject to its Power Budget. Several works have proposed dynamically throttling the transfer rate or parallelism degree of an SSD to regulate the temperature of the SSD [Park et al 2009b;Lee et al 2013]. To properly design the SSD internal parallelism, we need to incorporate the channel switch delay, way switch delay, and page write latency.…”
Section: Peak Energy Consumptionmentioning
confidence: 99%
See 1 more Smart Citation
“…We implement Ditto and evaluate its performance with both synthesized and real-world workloads [37,67,83]. Ditto is more elastic than Redis regarding resource efficiency and the speed of resource adjustments.…”
Section: Introductionmentioning
confidence: 99%