2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing 2010
DOI: 10.1109/ccgrid.2010.116
|View full text |Cite
|
Sign up to set email alerts
|

Streamflow Programming Model for Data Streaming in Scientific Workflows

Abstract: Geo-sciences involve large-scale parallel models, high resolution real time data from highly asynchronous and heterogeneous sensor networks and instruments, and complex analysis and visualization tools. Scientific workflows are an accepted approach to executing sequences of tasks on scientists' behalf during scientific investigation. Many geo-science workflows have the need to interact with sensors that produce large continuous streams of data, but programming models provided by scientific workflows are not eq… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(13 citation statements)
references
References 20 publications
0
13
0
Order By: Relevance
“…In [11], authors propose a framework to compensate for the impedance mismatch between scientific workflows and continuous data streams. They also propose workflow semantics to incorporate stream in scientific workflows.…”
Section: Related Workmentioning
confidence: 99%
“…In [11], authors propose a framework to compensate for the impedance mismatch between scientific workflows and continuous data streams. They also propose workflow semantics to incorporate stream in scientific workflows.…”
Section: Related Workmentioning
confidence: 99%
“…Of course, a naïve implementation involves computing the cumulative sum by simply adding each element of the input to the previous element of the output. Such an approach would have a runtime of N. However, a more efficient approach has existed since the development of APL if not for longer 16 . To ensure that the reader has some intuition as to how this could be possible, the key idea is to exploit the partial sums that are calculated in an adder tree and to express each element of the cumulative sum as a sum of these (efficiently calculated) partial sums.…”
Section: Cumulative Summentioning
confidence: 99%
“…These models include MapReduce [15], Stream Processing [11,12,16] and Query-based techniques [17,18]. Here, we focus on one such programming model, MapReduce.…”
Section: Big Data Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…They have the total control over a dedicated optical network so that a deterministic path can be obtained with advance reservation or on demand, while our work uses a TCP/IP based shared network, such that bandwidth is more of an issue. Streamflow [40] proposes a framework that enhances a traditional scientific workflow infrastructure to enable the stream processing capability, and resource allocation is not discussed. In [41], a distributed and pipelined dataflow execution system is proposed.…”
Section: E Performance Comparisonmentioning
confidence: 99%