Fifth IEEE/ACM International Workshop on Grid Computing
DOI: 10.1109/grid.2004.36
|View full text |Cite
|
Sign up to set email alerts
|

High Performance Threaded Data Streaming for Large Scale Simulations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
24
0

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 30 publications
(25 citation statements)
references
References 7 publications
1
24
0
Order By: Relevance
“…Alternatively, a number of scientific workflow systems have adopted a more expressive language for modeling scientific workflows based on dataflow process networks [19,15], a model of computation that comes with "built-in" support for stream-based and concurrent execution. 1 Dataflow is a natural paradigm for data-driven and dataintensive scientific workflows such as, e.g., the terabytesized Fusion Plasma Simulation [3] and the Terascale Supernova Initiative [31]. Workflows expressed using dataflow process networks can be efficiently analysed and scheduled [17], and are also a simple and intuitive model for workflow designers [4].…”
Section: Introductionmentioning
confidence: 99%
“…Alternatively, a number of scientific workflow systems have adopted a more expressive language for modeling scientific workflows based on dataflow process networks [19,15], a model of computation that comes with "built-in" support for stream-based and concurrent execution. 1 Dataflow is a natural paradigm for data-driven and dataintensive scientific workflows such as, e.g., the terabytesized Fusion Plasma Simulation [3] and the Terascale Supernova Initiative [31]. Workflows expressed using dataflow process networks can be efficiently analysed and scheduled [17], and are also a simple and intuitive model for workflow designers [4].…”
Section: Introductionmentioning
confidence: 99%
“…have been working in collaboration with LoCI Lab on the development of new techniques for buffering and transferring data generated by simulations running on large supercomputers at NERSC (Seaborg) and ONRL (Phoenix) to PPPL for analysis and visualization [5]. This work takes advantage of the presence of interoperable IBP depots at PPPL and at NERSC and ORNL, allowing data to be transferred directly from the compute nodes to the PPPL depots up to the speed permitted by the institutional firewall (currently limited to 100Mbps).…”
Section: Fusion Energy: Gyrokinetic Particle Simulation Of Turbulent mentioning
confidence: 99%
“…In [6], when the tasks on the workflow are executed by the same machine the input and output is connected through a pipe. The examples of data streaming between the tasks that are executed by the separate machines are Threaded Data Streaming [7] and Styx Grid Service (SGS) [8]. The former aims at transferring the terabyte-scale simulation data to local analysis visualization cluster.…”
Section: Storage Layermentioning
confidence: 99%