Proceedings of the Thirtieth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems 2011
DOI: 10.1145/1989284.1989302
|View full text |Cite
|
Sign up to set email alerts
|

Provenance for aggregate queries

Abstract: We study in this paper provenance information for queries with aggregation. Provenance information was studied in the context of various query languages that do not allow for aggregation, and recent work has suggested to capture provenance by annotating the different database tuples with elements of a commutative semiring and propagating the annotations through query evaluation. We show that aggregate queries pose novel challenges rendering this approach inapplicable. Consequently, we propose a new approach, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
164
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 124 publications
(164 citation statements)
references
References 48 publications
0
164
0
Order By: Relevance
“…The investigation of approximate probability computation for queries with aggregates via d-trees for semiring and semimodule expressions [2] is compelling since the performance of exact computation for such queries is prohibitively low. Another promising extension of this work is to consider model bounds with respect to more expressive formula languages such as decision diagrams.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…The investigation of approximate probability computation for queries with aggregates via d-trees for semiring and semimodule expressions [2] is compelling since the performance of exact computation for such queries is prohibitively low. Another promising extension of this work is to consider model bounds with respect to more expressive formula languages such as decision diagrams.…”
Section: Discussionmentioning
confidence: 99%
“…Limitations (1) and (2) have been shown to be a fundamental constraint for its applicability to ranking in probabilistic databases [43]. In order to decide the relative rank of two answer tuples, it suffices to approximate their probability intervals until they are disjoint; this requires the approximation to be both deterministic and incremental.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…We then propagate them to annotations of analysis results. In other contexts (various query languages) it was shown that it suffices to compute provenance and then evaluate it (specialize it) in specific meta-domains [23,17,4].…”
Section: Introductionmentioning
confidence: 99%
“…We further analyze the complexity of computing such expressions as well as their possible sizes. Then we consider the commutation with homomorphism property, which was proven for various database query languages in [23,17,4]. The property is essential for the soundness of applying provenance to provisioning and to specialization in metadomains of interest.…”
Section: Introductionmentioning
confidence: 99%