2020
DOI: 10.1016/j.future.2017.11.028
|View full text |Cite
|
Sign up to set email alerts
|

Data reduction in scientific workflows using provenance monitoring and user steering

Abstract: Scientific workflows need to be iteratively, and often interactively, executed for large input datasets. Reducing data from input datasets is a powerful way to reduce overall execution time in such workflows. When this is accomplished online (i.e., without requiring the user to stop execution to reduce the data, and then resume), it can save much time. However, determining which subsets of the input data should be removed becomes a major problem. A related problem is to guarantee that the workflow system will … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 14 publications
(31 citation statements)
references
References 30 publications
0
31
0
Order By: Relevance
“…In this section, we briefly explain each of them. Although presented separately, we notice that those categories share a lot of data (Ogasawara et al, 2011;Oliveira et al, 2010;Silva et al, 2017;Souza et al, 2020). Storing them separately leads to data redundancy and lack of data integration support for runtime data analysis.…”
Section: Data Management In Large-scale Workflowsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this section, we briefly explain each of them. Although presented separately, we notice that those categories share a lot of data (Ogasawara et al, 2011;Oliveira et al, 2010;Silva et al, 2017;Souza et al, 2020). Storing them separately leads to data redundancy and lack of data integration support for runtime data analysis.…”
Section: Data Management In Large-scale Workflowsmentioning
confidence: 99%
“…Risers Fatigue Analysis Workflow ( Souza et al, 2020 ) is a real case study from the Oil & Gas industry. This workflow calculates the fatigue of ultra-deep oil platform structures, such as risers.…”
Section: Experimental Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…All information are stored in a workflow database and thus can be changed during run time. In [26], this feature is used to reduce the data while it is already processed. However, instead of removing, data can also be added.…”
Section: Related Workmentioning
confidence: 99%
“…In Davidson et al [11], a theoretical model of privacy for module functionality in workflows is developed. An adaptive workflow monitoring approach that proposes a solution to the input data reduction problem in scientific workflows is presented in Souza et al [12]. Morshedzadeh et al [13] present a PMS that is based on an industrial case study for the product lifecycle management.…”
Section: Related Workmentioning
confidence: 99%