2018
DOI: 10.1007/978-3-319-98379-0_1
|View full text |Cite
|
Sign up to set email alerts
|

Provenance Annotation and Analysis to Support Process Re-computation

Abstract: Many resource-intensive analytics processes evolve over time following new versions of the reference datasets and software dependencies they use. We focus on scenarios in which any version change has the potential to affect many outcomes, as is the case for instance in high throughput genomics where the same process is used to analyse large cohorts of patient genomes, or cases. As any version change is unlikely to affect the entire population, an efficient strategy for restoring the currency of the outcomes re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…However, these tools require adapting existing user workflows and work practices to a specific software ecosystem, which is a challenge for scientific ecosystems. Also, they do not extensively capture the human elements of a workflow (e.g., lab notebooks), and are primarily limited to simple user annotations [11], [12]. Science Capsule collects workflow runtime and provenance information in a non-intrusive manner through system-level monitoring, while allowing users to enhance the information by managing userdefined artifacts as part of the workflow.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, these tools require adapting existing user workflows and work practices to a specific software ecosystem, which is a challenge for scientific ecosystems. Also, they do not extensively capture the human elements of a workflow (e.g., lab notebooks), and are primarily limited to simple user annotations [11], [12]. Science Capsule collects workflow runtime and provenance information in a non-intrusive manner through system-level monitoring, while allowing users to enhance the information by managing userdefined artifacts as part of the workflow.…”
Section: Related Workmentioning
confidence: 99%
“…This is challenging for workflows using experimental and observational data like that from the light sources, where researchers reuse workflows for real-time data analyses to refine the experiment and for post-experiment data analyses. Previous work in recording human activities in provenance is also limited to simple annotations [11], [12] that is not sufficient to capture the scientific complexities and thought process behind an experiment design or a workflow. Thus, there is a need to capture the relevant context about a workflow and its artifacts along with runtime information; that can seamlessly integrate with existing scientific software ecosystems to enable sharing, reuse and reproduction of workflows and their data.…”
Section: Introductionmentioning
confidence: 99%