2019
DOI: 10.1007/s42514-019-00005-9
|View full text |Cite
|
Sign up to set email alerts
|

Software-defined QoS for I/O in exascale computing

Abstract: Supercomputers' capability is approaching the exascale level, which enables large computing systems to run more jobs concurrently. Since modern data-intensive scientific applications can sometimes produce millions of I/O requests per second, I/O systems always suffer from heavy workloads and impede the overall performance. How to allocate I/O resources and guarantee the QoS (Quality of Service) for each individual application is becoming an increasingly important question. In this paper, we propose SDQoS, a so… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…At each time-step, the progress of each application is monitored as the number of I/O transfers that have been granted so far. In a related paper [11], the authors survey I/O capabilities of state-of-the-art supercomputers and enforce QoS constraints for I/O transfers by implementing a token-based bucket algorithm that works similarly to that of [24]. Finally, the authors of [23] target a system with several I/O sub-systems (OST, which stands for Object Storage Target, typically a RAID array of disks).…”
Section: I/o-copmentioning
confidence: 99%
“…At each time-step, the progress of each application is monitored as the number of I/O transfers that have been granted so far. In a related paper [11], the authors survey I/O capabilities of state-of-the-art supercomputers and enforce QoS constraints for I/O transfers by implementing a token-based bucket algorithm that works similarly to that of [24]. Finally, the authors of [23] target a system with several I/O sub-systems (OST, which stands for Object Storage Target, typically a RAID array of disks).…”
Section: I/o-copmentioning
confidence: 99%
“…PADLL is able to control the rate of both data and metadata workflows. Other systems are directly implemented within core layers of the HPC I/O stack, including the PFS [14], [18], [20], [22], [23], scheduler [21], and I/O libraries [16], [17]. These solutions are intrusive and offer limited maintainability and portability.…”
Section: Related Workmentioning
confidence: 99%
“…While there are numerous solutions to assess the bottlenecks generated from data workflows in HPC clusters [13], [14], [16]- [23], the metadata counterpart has not received the same level of attention, and existing approaches are suboptimal. * Corresponding author: Ricardo Macedo (ricardo.g.macedo@inesctec.pt).…”
Section: Introductionmentioning
confidence: 99%
“…System overheadQoS guarantee has been extensively studied in storage systems and implemented in different ways. Hua et al22 proposed a software-defined QoS framework that using token-bucket mechanisms for bandwidth control and for guaranteeing applications' I/O requirements. The key idea of this work is to integrate software-defined components into storage systems and provide a fine-grained QoS.…”
mentioning
confidence: 99%