2008
DOI: 10.1007/978-3-540-89965-5_26
|View full text |Cite
|
Sign up to set email alerts
|

Advances and Challenges for Scalable Provenance in Stream Processing Systems

Abstract: Abstract. While data provenance is a well-studied topic in both database and workflow systems, its support within stream processing systems presents a new set of challenges. Part of the challenge is the high stream event rate and the low processing latency requirements imposed by many streaming applications. For example, emerging streaming applications in healthcare or finance call for data provenance, as illustrated in the Century stream processing infrastructure that we are building for supporting online hea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0

Year Published

2010
2010
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(23 citation statements)
references
References 13 publications
0
23
0
Order By: Relevance
“…It can be categorized with coarse-grained provenance methods that identify dependencies between streams or sets of streams [27,26], and fine-grained methods that identify dependencies among individual stream elements [23,10,18,22]. Sansrimahachai et al [23] propose the Stream Ancestor Function -a reverse mapping function to express precise dependencies between input and output stream elements (fine-grained).…”
Section: Related Workmentioning
confidence: 99%
“…It can be categorized with coarse-grained provenance methods that identify dependencies between streams or sets of streams [27,26], and fine-grained methods that identify dependencies among individual stream elements [23,10,18,22]. Sansrimahachai et al [23] propose the Stream Ancestor Function -a reverse mapping function to express precise dependencies between input and output stream elements (fine-grained).…”
Section: Related Workmentioning
confidence: 99%
“…Bertino and his associates [12,13,36] developed and further improved the frameworks that evaluate the trustworthiness of data based on data provenance. Data provenance has also been investigated in online health care analytics for a biomedical data stream system in IBM's Century [37,38]. Malaverri et al [39] introduced a provenance-based approach for evaluating data managed by E-Science applications.…”
Section: History Of Trustworthiness Frameworkmentioning
confidence: 99%
“…Further, the acquired provenance information could be made accessible in different provenance models like e.g. the Open Provenance Model (OPM) [12] or the value centric model (TVC) [13]. However these are just alternative representations of the derived provenance data, while the focus of this paper is on acquiring the provenance data rather then how to represent them.…”
Section: Related Workmentioning
confidence: 99%