Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies 2012
DOI: 10.1145/2443416.2443418
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating parameter sweep workflows in high performance computing

Abstract: Scientific experiments based on computer simulations can be defined, executed and monitored using Scientific Workflow Management Systems (SWfMS). Several SWfMS are available, each with a different goal and a different engine. Due to the exploratory analysis, scientists need to run parameter sweep (PS) workflows, which are workflows that are invoked repeatedly using different input data. These workflows generate a large amount of tasks that are submitted to High Performance Computing (HPC) environments. Differe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…• Workload analysis for raw data extraction (named as WORKLOAD): Evaluation of the raw data extraction cost using the Oil and Gas dataflow with different workloads. In this situation, we consider two analyses that evaluate different execution times for data transformation following a gamma distribution [30], and dataflow execution varying the size of raw data files. • Query processing (named as QUERY): Analyses of the raw data query-processing support.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…• Workload analysis for raw data extraction (named as WORKLOAD): Evaluation of the raw data extraction cost using the Oil and Gas dataflow with different workloads. In this situation, we consider two analyses that evaluate different execution times for data transformation following a gamma distribution [30], and dataflow execution varying the size of raw data files. • Query processing (named as QUERY): Analyses of the raw data query-processing support.…”
Section: Resultsmentioning
confidence: 99%
“…The extraction cost considers the elapsed time to process a raw data file, extract the specific information, and store this data in the provenance database. Workload analysis for raw data extraction (named as WORKLOAD): Evaluation of the raw data extraction cost using the Oil and Gas dataflow with different workloads. In this situation, we consider two analyses that evaluate different execution times for data transformation following a gamma distribution , and dataflow execution varying the size of raw data files. Query processing (named as QUERY): Analyses of the raw data query‐processing support. We present examples to analyze domain‐specific file contents, multiple files related by simulation programs, and specific related elements from multiple raw data files.…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…Workflow-oriented motifs may correspond to remote invocations, repetitive activities, parameter sweep workflows and meta-workflows [12,57]. A parameter sweep workflow is a workflow with multiple input parameter sets, which needs to be executed for each input parameter set [25,64]. A meta-workflow is a workflow composed of sub-workflows.…”
Section: Scientific Workflow Examplesmentioning
confidence: 99%
“…The multiple objectives are mainly attained by adjusting workflow scheduling at the WEP execution layer. Having an algebra and dataflow-oriented execution engine opens up interesting opportunities for optimization [45,25]. For example, it allows for user interference on the execution plan, even during the execution.…”
Section: Wep Generation Layermentioning
confidence: 99%
“…Workflow activities execute programs that consume and produce data (parameters values and files). An output data produced by an activity can be consumed as input data to another activity, establishing a dependency relation between those activities [12].…”
Section: Workflow Based Platformmentioning
confidence: 99%