Optimizing the Block I/O Subsystem for Fast Storage Devices

Yu, Young Jin; Shin, Dong In; Shin, Woong; Song, Nae Young; Choi, Jae Woo; Kim, Hyeong‐Seog; Eom, Hyeonsang; Yeom, Heon Y.

doi:10.1145/2619092

Cited by 35 publications

(16 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several works propose I/O stack optimizations to take advantage of fast NVMe devices [5,59,67]. Recently, Intel introduced SPDK [24]: a set of tools and libraries for writing high-performance user-mode storage applications that reduce kernel context switches and eliminate interrupt handling overheads.…”

Section: Related Workmentioning

confidence: 99%

Performance Characterization of NVMe-over-Fabrics Storage Disaggregation

Guz

Shayesteh

et al. 2018

ACM Trans. Storage

View full text Add to dashboard Cite

Storage disaggregation separates compute and storage to different nodes to allow for independent resource scaling and, thus, better hardware resource utilization. While disaggregation of hard-drives storage is a common practice, NVMe-SSD (i.e., PCIe-based SSD) disaggregation is considered more challenging. This is because SSDs are significantly faster than hard drives, so the latency overheads (due to both network and CPU processing) as well as the extra compute cycles needed for the offloading stack become much more pronounced.In this work, we characterize the overheads of NVMe-SSD disaggregation. We show that NVMe-over-Fabrics (NVMe-oF)-a recently released remote storage protocol specification-reduces the overheads of remote access to a bare minimum, thus greatly increasing the cost-efficiency of Flash disaggregation. Specifically, while recent work showed that SSD storage disaggregation via iSCSI degrades application-level throughput by 20%, we report on negligible performance degradation with NVMe-oF-both when using stress-tests as well as with a more-realistic KV-store workload. 31:2 Z. Guz et al. Fig. 1. NVMe-over-Fabrics Architecture.40 times better than SATA-SSDs. Indeed, many large-scale cloud companies have reported using PCIe-based SSDs as part of their infrastructure [19,31,49,57].Generally, storage devices can either be co-located within the compute server nodes, or be placed in dedicated storage nodes accessed through the network. Large-scale cloud companies originally used scale-out infrastructure, composing their data centers out of commodity servers that tightly coupled memory, storage, and compute [3,38]. Unfortunately, this approach leads to inefficiencies and resource underutilization, because it fixes the ratio between compute, memory, storage, and network.Resource underutilization in data centers is a common, well-documented phenomenon [3,31,36,49]. Since the usage of different resources changes over time, predominantly independently from one another, there is no single static resource balance that fits every application that a server supports throughout its lifetime. Since changing these ratios at scale is economically unfeasible [51], server resources are often over-provisioned, leading to an increased total cost of ownership [3]. Specifically, NVMe-SSDs tend to be over-provisioned in terms of both load (IOPS) and capacity: capacity is over-provisioned to allow for future growth, and load is underutilized, because other software overheads tend to saturate the CPU well before reaching the drive limits [11,49]. Indeed, PCIe-based Flash has been argued to be "problematically fast" [64].Resource disaggregation is a common approach to mitigate the problem of over-provisioning. Specifically, storage disaggregation decouples compute and storage to different nodes (i.e., different servers), allowing independent scaling of each resource according to dynamic needs. It provides more flexibility when tuning the infrastructure to specific loads, because compute and storage can be configured for concret...

show abstract

Section: Related Workmentioning

confidence: 99%

Performance Characterization of NVMe-over-Fabrics Storage Disaggregation

Guz

Shayesteh

et al. 2018

ACM Trans. Storage

View full text Add to dashboard Cite

show abstract

“…A number of studies have provided proof of the I/O software stack being the major performance bottleneck in future storage systems. Yu et al (Yu et al 2014) analyzed system software overheads and propose six optimizations that enable operating systems to fully exploit the performance characteristics of storage devices based on non-volatile media. They proposed using polling over interrupts, bypassing the I/O scheduler for certain types of requests, and using an asynchronous I/O path, among other optimizations.…”

Section: Software Stack Bottlenecks and Remediesmentioning

confidence: 99%

Performance analysis of NVMe SSDs and their implication on real world databases

Siyamwala

Ghosh

et al. 2015

Proceedings of the 8th ACM International Systems and Storage Conference

113

View full text Add to dashboard Cite

The storage subsystem has undergone tremendous innovation in order to keep up with the ever-increasing demand for throughput. Non Volatile Memory Express (NVMe) based solid state devices are the latest development in this domain, delivering unprecedented performance in terms of latency and peak bandwidth. NVMe drives are expected to be particularly beneficial for I/O intensive applications, with databases being one of the prominent use-cases.This paper provides the first, in-depth performance analysis of NVMe drives. Combining driver instrumentation with system monitoring tools, we present a breakdown of access times for I/O requests throughout the entire system. Furthermore, we present a detailed, quantitative analysis of all the factors contributing to the low-latency, high-throughput characteristics of NVMe drives, including the system software stack. Lastly, we characterize the performance of multiple cloud databases (both relational and NoSQL) on stateof-the-art NVMe drives, and compare that to their performance on enterprise-class SATA-based SSDs. We show that NVMe-backed database applications deliver up to 8× superior client-side performance over enterprise-class, SATAbased SSDs.

show abstract

“…As the hardware latency constantly decreases, many studies have been conducted to diminish the storage stack latency, along the way. There are a tremendous number of studies that strive to reduce the kernel overhead by eliminating unnecessary context processing [67,68,69], employing a polling mechanism instead of interrupts [70,71,67,69], and performance isolation [72,73]. Shin et al [67] present a low-level hardware abstraction layer interface which curtails scheduling delays caused by extra contexts to optimize the I/O path.…”

Section: I/o Stack In Vm Hypervisorsmentioning

confidence: 99%

“…Shin et al [67] present a low-level hardware abstraction layer interface which curtails scheduling delays caused by extra contexts to optimize the I/O path. Yu et al [68] demonstrate six optimization schemes to fully utilize high performance introduced by fast storage devices. The proposed schemes in [68] relies on a hardware support to expand parallelism inside SSDs.…”

Section: I/o Stack In Vm Hypervisorsmentioning

confidence: 99%

See 1 more Smart Citation

Flash-based storage management in cloud computing datacenter infrastructure

Yang

View full text Add to dashboard Cite

I would like to express my sincere gratitude to the following people. It is impossible for me to complete my dissertation without the support of them. Firstly, I would like to thank my advisor, Prof. Ningfang Mi, for her continuous guidance and encouragement throughout my Ph.D. life with her patience, motivation, enthusiasm, and immense knowledge. I thank her for introducing me to the wonders of scientific research and providing me extensive personal and professional guidance. I could not have imagined having a better advisor and mentor for my Ph.D. study. Secondly, I would like to offer my special thanks my Ph.D. defense committee members, Prof. Yunsi Fei and Prof. Straitis Ioannidis, for their great support and insightful feedback and comments on my dissertation. Thirdly, my thanks also go to my labmates Ms.

show abstract

Optimizing the Block I/O Subsystem for Fast Storage Devices

Cited by 35 publications

References 48 publications

Performance Characterization of NVMe-over-Fabrics Storage Disaggregation

Performance Characterization of NVMe-over-Fabrics Storage Disaggregation

Performance analysis of NVMe SSDs and their implication on real world databases

Flash-based storage management in cloud computing datacenter infrastructure

Contact Info

Product

Resources

About