2010
DOI: 10.1007/s10586-010-0135-6
|View full text |Cite
|
Sign up to set email alerts
|

DataStager: scalable data staging services for petascale applications

Abstract: Known challenges for petascale machines are that (1) the costs of I/O for high performance applications can be substantial, especially for output tasks like checkpointing, and (2) noise from I/O actions can inject undesirable delays into the runtimes of such codes on individual compute nodes. This paper introduces the flexible 'DataStager' framework for data staging and alternative services within that jointly address (1) and (2). Data staging services moving output data from compute nodes to staging or I/O no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
107
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
4
3
3

Relationship

5
5

Authors

Journals

citations
Cited by 123 publications
(109 citation statements)
references
References 29 publications
2
107
0
Order By: Relevance
“…The IBM Blue Gene series of supercomputers (Yu et al, 2006) uses independent I/O nodes in their system to handle I/O requests, which are generated in computer nodes and forwarded to I/O nodes. DataStager, designed by Abbasi et al (2009), is a data staging service that provide asynchronous data extraction for ADIOS. ADIOS provides a simple function and an external XML file to configure the data structure and I/O methods.…”
Section: Related Workmentioning
confidence: 99%
“…The IBM Blue Gene series of supercomputers (Yu et al, 2006) uses independent I/O nodes in their system to handle I/O requests, which are generated in computer nodes and forwarded to I/O nodes. DataStager, designed by Abbasi et al (2009), is a data staging service that provide asynchronous data extraction for ADIOS. ADIOS provides a simple function and an external XML file to configure the data structure and I/O methods.…”
Section: Related Workmentioning
confidence: 99%
“…If there is no dependence between data requested in the I/O call and the execution following it, I/O time could be hidden behind the computation time. For data-intensive scientific applications, researchers developed in-situ [10] execution or DataStager [9], where the input/output data can be asynchronously loaded/unloaded into/out of compute nodes or a staging area. Applying data asynchronous access usually requires programmers' or users' involvements.…”
Section: A Reducing Disk Latencymentioning
confidence: 99%
“…However, ADIOS does support several staging methods (e.g., DataStager [39] and DataSpaces [40]), which provide in memory data to the application as an external component. Interval-IO differs from ADIOS most significantly in its support of the MPI-IO interface, which, as noted above, is one of the key goals of this research.…”
Section: Related Workmentioning
confidence: 99%