2021
DOI: 10.1016/j.jpdc.2020.10.007
|View full text |Cite
|
Sign up to set email alerts
|

An empirical study of I/O separation for burst buffers in HPC systems

Abstract: Develop an I/O separation scheme using multi-stream feature of solid state drives, which reduces garbage collection overheads, improves I/O throughput, and extends device lifetime.• Design a stream-aware scheduling policy to work with the I/O separation scheme and improve overall I/O throughput.• Improves performance without require users to change their I/O functions.• Increases I/O throughput by 44% and reduce Write Amplification by 20%.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…This becomes especially interesting for cases when the checkpoint data is not needed for analyses. To improve the total checkpoint performance, without changing to a faster checkpoint method, one could leverage burst buffer storage [17], [23], [22], if available on the cluster.…”
Section: Discussionmentioning
confidence: 99%
“…This becomes especially interesting for cases when the checkpoint data is not needed for analyses. To improve the total checkpoint performance, without changing to a faster checkpoint method, one could leverage burst buffer storage [17], [23], [22], if available on the cluster.…”
Section: Discussionmentioning
confidence: 99%
“…Although SSDs do not have such seek penalties, they do require periodic garbage collection as erase blocks become sparse, and this results in a distinct loss of bandwidth once an SSD becomes sufficiently "dirty" [15]. I/O separation has been shown to mitigate this age-related performance loss on parallel file systems [16], but such techniques require hardware support implemented in the file system, and Lustre does not yet support any such capabilities.…”
Section: B Performance Over Timementioning
confidence: 99%
“…The node runs simulations using Simple Linux Utility for Resource Management (SLURM). SLURM is a cluster workload manager for the Linux operating system [16]. This facility implements three main functions.…”
Section: A Material's Configurationsmentioning
confidence: 99%