2016 IEEE 12th International Conference on E-Science (E-Science) 2016
DOI: 10.1109/escience.2016.7870919
|View full text |Cite
|
Sign up to set email alerts
|

Crossing analytics systems: A case for integrated provenance in data lakes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
59
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(60 citation statements)
references
References 13 publications
0
59
0
1
Order By: Relevance
“…The integration of this information into the metadata system makes it possible to understand and explain possible inconsistencies in the data [3]. It can also be used to manage sensitive data, by detecting intrusions [26].…”
Section: Expected Featuresmentioning
confidence: 99%
See 2 more Smart Citations
“…The integration of this information into the metadata system makes it possible to understand and explain possible inconsistencies in the data [3]. It can also be used to manage sensitive data, by detecting intrusions [26].…”
Section: Expected Featuresmentioning
confidence: 99%
“…Thus, such features are often integrated together in a provenance tracking module [12,13,27]. Yet, we still consider that they remain different features since they are not systematically proposed together [3,5,26].…”
Section: Expected Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…This is known as schema-on-read or late binding (Fang, 2015;Miloslavskaya and Tolstoy, 2016). However, with big data volume and velocity coming into play, the absence of an explicit schema can quickly turn a data lake into an inoperable data swamp (Suriarachchi and Plale, 2016). Therefore, metadata management is a crucial component in data lakes (Quix et al, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…For systems that generate heterogeneous provenance data, such as in data lakes [44,45], standard representations can help reduce the complexity of integrating such provenance into a common model [46]. ABM relationships captured by ProvMASS are specified with concepts from the W3C PROV [47].…”
Section: Provenance-related Propertiesmentioning
confidence: 99%