2015
DOI: 10.1007/978-3-662-46641-4_4
|View full text |Cite
|
Sign up to set email alerts
|

Representing Interoperable Provenance Descriptions for ETL Workflows

Abstract: Abstract. The increasing availability of data on the Web provided by the emergence of Web 2.0 applications and, more recently by Linked Data, brought additional complexity to data management tasks, where the number of available data sources and their associated heterogeneity drastically increases. In this scenario, where data is reused and repurposed on a new scale, the pattern expressed as Extract-Transform-Load (ETL) emerges as a fundamental and recurrent process for both producers and consumers of data on t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 12 publications
0
5
0
Order By: Relevance
“…Provenance tracking for web data has been concretely explored by Freitas et al (2011). In this study, Prov4J (Freitas et al, 2010) -a framework for constructing a provenance system for web data -utilizing semantic web technologies has been presented. Prov4J utilizes resource definition framework (RDF) to represent provenance information and URIs (called ProvURIs) to associate the information resource in an application with its provenance descriptor stored in a provenance repository.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Provenance tracking for web data has been concretely explored by Freitas et al (2011). In this study, Prov4J (Freitas et al, 2010) -a framework for constructing a provenance system for web data -utilizing semantic web technologies has been presented. Prov4J utilizes resource definition framework (RDF) to represent provenance information and URIs (called ProvURIs) to associate the information resource in an application with its provenance descriptor stored in a provenance repository.…”
Section: Related Workmentioning
confidence: 99%
“…Provenance is not only used for describing the origin of result data, but also for explaining the process of data aggregation, assessing the quality of data and examining the execution of webbased data access. Several studies in this context (Hartig, 2009;Freitas et al, 2010; utilize provenance data models that are compatible with the community standard -the Open Provenance Model (OPM) (Moreau et al, 2011). However, because of flexibility in representing data and the support of interoperability in various provenance systems (Eckert et al, 2014), W3C PROV (Gil & Miles, 2016) -a new worldwide provenance standard -is utilized as a data model for our provenance solution.…”
Section: Related Workmentioning
confidence: 99%
“…Community efforts towards the convergence into a common provenance model led to the Open Provenance Model (OPM) 4 . OPM descriptions allow interoperability on the level of workflow structure.…”
Section: Provenance Modelmentioning
confidence: 99%
“…The authors used data integration toolkits as a workflow framework to support their LOD publishing process and provenance gathering facilities. Later, [13] and [20] investigated complementary perspectives of such approach. The first author described a vocabulary focused on modeling data transformation workflows.…”
Section: Provenance Initiativesmentioning
confidence: 99%
“…In order to represent the published linked provenance data, the Data Preparation and Transformation process adopts the semantic approach proposed by [13]. This approach was used in the provenance management architecture presented by [23] as described at the end of section 3.…”
Section: Data Preparation and Transformation Process Provenancementioning
confidence: 99%