Proceedings of the Twenty-Seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems 2008
DOI: 10.1145/1376916.1376932
|View full text |Cite
|
Sign up to set email alerts
|

Approximating predicates and expressive queries on probabilistic databases

Abstract: We study complexity and approximation of queries in an expressive query language for probabilistic databases. The language studied supports the compositional use of confidence computation. It allows for a wide range of new use cases, such as the computation of conditional probabilities and of selections based on predicates that involve marginal and conditional probabilities. These features have important applications in areas such as data cleaning and the processing of sensor data. We establish techniques for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
56
0

Year Published

2009
2009
2013
2013

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 31 publications
(57 citation statements)
references
References 21 publications
1
56
0
Order By: Relevance
“…It was first shown by Karp, Luby, and Madras [33] that there is a fully polynomial-time randomised approximation scheme (FPTRAS) for DNF counting based on Monte Carlo simulation. This algorithm can be modified to compute the probability of a DNF over independent discrete random variables [12,25,34,46]. These techniques yield an efficiently computable unbiased estimator that in expectation returns the probability p of a DNF of n clauses such that computing the average of a polynomial number of such Monte Carlo steps (which are calls to the Karp-Luby unbiased estimator) is an ( , δ)-approximation for the probability (i.e., a relative approximation): If the averagep is taken over at least 3 · n · log(2/δ)/ 2 Monte Carlo steps, then Pr | p −p| ≥ · p ≤ δ.…”
Section: Related Workmentioning
confidence: 99%
“…It was first shown by Karp, Luby, and Madras [33] that there is a fully polynomial-time randomised approximation scheme (FPTRAS) for DNF counting based on Monte Carlo simulation. This algorithm can be modified to compute the probability of a DNF over independent discrete random variables [12,25,34,46]. These techniques yield an efficiently computable unbiased estimator that in expectation returns the probability p of a DNF of n clauses such that computing the average of a polynomial number of such Monte Carlo steps (which are calls to the Karp-Luby unbiased estimator) is an ( , δ)-approximation for the probability (i.e., a relative approximation): If the averagep is taken over at least 3 · n · log(2/δ)/ 2 Monte Carlo steps, then Pr | p −p| ≥ · p ≤ δ.…”
Section: Related Workmentioning
confidence: 99%
“…These systems model data with relations and therefore, they cannot perform shortest path computations on graphs efficiently. Also, since computing exact answers to many typical SQL queries has been shown to have #P-complete data complexity [13], research has focused on computing approximate answers [25,34].…”
Section: Related Workmentioning
confidence: 99%
“…Koch [30] formalizes a language that allows predication on probabilities and discusses approximation algorithms for this richer language, though he does not consider HAVING aggregation. This is in part due to the fact that his aim is to create a fully compositional language for probabilistic databases [31].…”
Section: Related Workmentioning
confidence: 99%