2019
DOI: 10.1145/3340287
|View full text |Cite
|
Sign up to set email alerts
|

Enabling Efficient Updates in KV Storage via Hashing

Abstract: Persistent key-value (KV) stores mostly build on the Log-Structured Merge (LSM) tree for high write performance, yet the LSM-tree suffers from the inherently high I/O amplification. KV separation mitigates I/O amplification by storing only keys in the LSM-tree and values in separate storage. However, the current KV separation design remains inefficient under update-intensive workloads due to its high garbage collection (GC) overhead in value storage. We propose HashKV, which aims for high update performance at… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…HashKV [4] introduces a hash-based Value Store layout, hashing the key to determine its group and appending the value to the group's end. While HashKV improves garbage collection performance compared to WiscKey, it does not significantly change range query performance.…”
Section: Garbage Collection and Range Query In Key-value Separated St...mentioning
confidence: 99%
“…HashKV [4] introduces a hash-based Value Store layout, hashing the key to determine its group and appending the value to the group's end. While HashKV improves garbage collection performance compared to WiscKey, it does not significantly change range query performance.…”
Section: Garbage Collection and Range Query In Key-value Separated St...mentioning
confidence: 99%
“…7 Related Work In addition to FTL in SSD controllers, log structured stores have been widely used in various data management components and systems. Examples include keyvalue stores [11,12], NoSQL systems [1,25], in-memory stores [19,20], log structured variants of B + -trees [13,22], and the key-value separation design [5,14,16] for LSM-trees [17,18]. In these systems, cleaning is often the new bottleneck because of the expensive write amplification incurred by the cleaning process.…”
Section: Tpc-c Workloadsmentioning
confidence: 99%
“…Improving read performance requires writing additional information, such as indexes, which can result in write amplification [3], [4], [5] and space amplification [6]. LSM-tree performance is primarily enhanced by optimizing the timing of compression strategies [7], [8], [9], employing key-value separation [10], [11], and utilizing automatic tuning [12], [13], [14]. To mitigate database performance fluctuations caused by compression and merging operations, the timing of compression strategies can be optimized by reducing these operations under high-load conditions, given their significant disk input and output requirements.…”
Section: Introductionmentioning
confidence: 99%