2020
DOI: 10.1007/s11390-020-9801-1
|View full text |Cite
|
Sign up to set email alerts
|

Ad Hoc File Systems for High-Performance Computing

Abstract: Storage backends of parallel compute clusters are still based mostly on magnetic disks, while newer and faster storage technologies such as flash-based SSDs or non-volatile random access memory (NVRAM) are deployed within compute nodes. Including these new storage technologies into scientific workflows is unfortunately today a mostly manual task, and most scientists therefore do not take advantage of the faster storage media. One approach to systematically include node-local SSDs or NVRAMs into scientific work… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 29 publications
(14 citation statements)
references
References 66 publications
0
14
0
Order By: Relevance
“…While emerging storage technologies [39]- [41] that aim to elevate I/O bottlenecks, and projects such as SAGE [42] and DAOS [43], are being intensively studied, I/O will likely remain a challenging aspect when deploying ML workload. By enabling fine-grained profiling and tracing capability, we also enable the opportunity for automated decision making and auto-tuning in the future.…”
Section: Discussionmentioning
confidence: 99%
“…While emerging storage technologies [39]- [41] that aim to elevate I/O bottlenecks, and projects such as SAGE [42] and DAOS [43], are being intensively studied, I/O will likely remain a challenging aspect when deploying ML workload. By enabling fine-grained profiling and tracing capability, we also enable the opportunity for automated decision making and auto-tuning in the future.…”
Section: Discussionmentioning
confidence: 99%
“…This requires either an application-specific data management implementation, or the addition of a system software layer that manages the data and presents it to the application. One such software layer is GekkoFS [7,8] , an ephemeral file system that is created just-in-time for individual jobs, and that sits on top of the B-APM working across nodes. GekkoFS is ephemeral because it is only live for the duration of a job, and it therefore also needs the ability to move the data into, and out of, B-APM.…”
Section: B-apm As a Distributed File Systemmentioning
confidence: 99%
“…IOR benchmark documentation, https://ior.readthedocs.io8 The IO500 list, https://www.vi4io.org/io500/start…”
mentioning
confidence: 99%
“…A relevant discussion and in-depth analysis on ephemeral systems has already been done by Brinkmann et al [5]; that article discusses the general ideas of ad-hoc file systems as well as the specific characteristics of three implementations: BeeOND [13], GekkoFS [37], and BurstFS [39].…”
Section: Ephemeral Systemsmentioning
confidence: 99%
“…There is a family of storage systems that tackle the interference issues and some of the aforementioned design challenges with a distinct approach: ephemeral storage systems, such as ad-hoc filesystems [5]. Ephemeral systems are designed to be executed next to the computation, as opposed to the more traditional approach of independent storage nodes (shared by multiple applications).…”
Section: Introductionmentioning
confidence: 99%