Enabling Efficient Updates in KV Storage via Hashing

Chan, Helen H. W.; Lee, Patrick P. C.; Xu, Yinlong

doi:10.1145/3340287

Cited by 12 publications

(3 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…HashKV [4] introduces a hash-based Value Store layout, hashing the key to determine its group and appending the value to the group's end. While HashKV improves garbage collection performance compared to WiscKey, it does not significantly change range query performance.…”

Section: Garbage Collection and Range Query In Key-value Separated St...mentioning

confidence: 99%

TreeKV: efficient garbage collection and range query for key-value separated LSM-stores

Luan

2024

Third International Symposium on Computer Applications and Information Systems (ISCAIS 2024)

View full text Add to dashboard Cite

LSM-tree is favored in write-intensive scenarios due to its sequential write feature, but it grapples with read and write amplification issues. A common solution is to store keys and values separately, with the LSM-tree holding only the key and value address, while the value is kept separately. However, this approach presents challenges in range query and garbage collection, necessitating multiple random reads and pointless data rewriting. To address these issues, we introduce TreeKV, a system that uses a specialized B+ tree for value storage, grouping values with adjacent keys together. This reduces random reads during range queries and eliminates unnecessary data rewriting during garbage collection. We also delve into crash recovery and minimal-blocking garbage collection implementation in TreeKV. Our comparison with existing key-value separated stores shows that TreeKV, while maintaining high read-write performance, enhances range query and garbage collection performance.

show abstract

Section: Garbage Collection and Range Query In Key-value Separated St...mentioning

confidence: 99%

TreeKV: efficient garbage collection and range query for key-value separated LSM-stores

Luan

2024

Third International Symposium on Computer Applications and Information Systems (ISCAIS 2024)

View full text Add to dashboard Cite

show abstract

“…7 Related Work In addition to FTL in SSD controllers, log structured stores have been widely used in various data management components and systems. Examples include keyvalue stores [11,12], NoSQL systems [1,25], in-memory stores [19,20], log structured variants of B + -trees [13,22], and the key-value separation design [5,14,16] for LSM-trees [17,18]. In these systems, cleaning is often the new bottleneck because of the expensive write amplification incurred by the cleaning process.…”

Section: Tpc-c Workloadsmentioning

confidence: 99%

Efficiently Reclaiming Space in a Log Structured Store

Lomet

Luo

2020

Preprint

View full text Add to dashboard Cite

A log structured store uses a single write I/O for a number of diverse and non-contiguous pages within a large buffer instead of using a write I/O for each page separately. This requires that pages be relocated on every write, because pages are never updated in place. Instead, pages are dynamically remapped on every write. Log structuring was invented for and used initially in file systems. Today, a form of log structuring is used in SSD controllers because an SSD requires the erasure of a large block of pages before flash storage can be reused. No update-in-place requires that the storage for out-of-date pages be reclaimed (garbage collected or "cleaned"). We analyze cleaning performance and introduce a cleaning strategy that uses a new way to prioritize the order in which stale pages are garbage collected. Our cleaning strategy approximates an "optimal cleaning strategy". Simulation studies confirm the results of the analysis. This strategy is a significant improvement over previous cleaning strategies.

show abstract

“…Improving read performance requires writing additional information, such as indexes, which can result in write amplification [3], [4], [5] and space amplification [6]. LSM-tree performance is primarily enhanced by optimizing the timing of compression strategies [7], [8], [9], employing key-value separation [10], [11], and utilizing automatic tuning [12], [13], [14]. To mitigate database performance fluctuations caused by compression and merging operations, the timing of compression strategies can be optimized by reducing these operations under high-load conditions, given their significant disk input and output requirements.…”

Section: Introductionmentioning

confidence: 99%

TATune: A RocksDB Knob Tuning System Based on Transformer

Hu,

Wang

2023

IEEE Access

View full text Add to dashboard Cite

RocksDB is a powerful database engine that offers a wide range of adjustable knobs, which greatly influence its performance. However, configuring RocksDB manually for optimal performance is challenging due to the large number of available knobs and their complex settings. To address this issue, we propose Transformer Adaptive Gentic Algorithm Tune(TATune), an auto-tuning system for RocksDB knobs. In TATune, knob configuration files for RocksDB are randomly generated and executed at different preset workloads first. Subsequently, the correlation between the knob and RocksDB performance is learned by the prediction model based on Transformer. Finally, an adaptive genetic algorithm that utilizes the prediction model as a fitness function to recommend the RocksDB knob setting. Additionally, a novel optimization metric is also proposed to evaluate the performance of the auto-tuning RocksDB knob system. TATune is compared with other approaches to configure RockDB knobs on six distinct workloads. The results indicate that TATune is effective and achieves significant performance improvement across various target workloads. The final average optimization performance is 26% better than K2vTune and 72% better than RTune.

show abstract

Enabling Efficient Updates in KV Storage via Hashing

Cited by 12 publications

References 30 publications

TreeKV: efficient garbage collection and range query for key-value separated LSM-stores

TreeKV: efficient garbage collection and range query for key-value separated LSM-stores

Efficiently Reclaiming Space in a Log Structured Store

TATune: A RocksDB Knob Tuning System Based on Transformer

Contact Info

Product

Resources

About