2009
DOI: 10.1007/s10619-009-7050-y
|View full text |Cite
|
Sign up to set email alerts
|

Semantics and evaluation of top-k queries in probabilistic databases

Abstract: We formulate three intuitive semantic properties for topk queries in probabilistic databases, and propose GlobalTopk query semantics which satisfies all of them. We provide a dynamic programming algorithm to evaluate top-k queries under Global-Topk semantics in simple probabilistic relations. For general probabilistic relations, we show a polynomial reduction to the simple case. Our analysis shows that the complexity of query evaluation is linear in k and at most quadratic in database size.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
71
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 66 publications
(71 citation statements)
references
References 23 publications
0
71
0
Order By: Relevance
“…Contrasting the properties of different semantics of ω * , and studying their potential applications have been addressed in recent works [7,22,24,26]. Our focus in this paper, however, is building an infrastructure that incrementally computes both J k and ω * under multiple semantics in the context of URANKJOIN queries.…”
Section: Problem Definitionmentioning
confidence: 99%
“…Contrasting the properties of different semantics of ω * , and studying their potential applications have been addressed in recent works [7,22,24,26]. Our focus in this paper, however, is building an infrastructure that incrementally computes both J k and ω * under multiple semantics in the context of URANKJOIN queries.…”
Section: Problem Definitionmentioning
confidence: 99%
“…Zhang and Chomicki developed the global top-k semantics on uncertain data which returns k tuples having the largest probability in the top-k list, and gave a dynamic programming algorithm [56].…”
Section: Top-k Queries On Uncertain Datamentioning
confidence: 99%
“…Recently, there have been a few interesting studies on ranking queries on static uncertain data using the possible world semantics [30,31,40,49,54,56]. Among them, probabilistic threshold top-k queries [30,31] compute uncertain records taking a probability of at least p to be in the top-k list where p is a user specified probability threshold.…”
Section: Introductionmentioning
confidence: 99%
“…Also the Expected rank query satisfy all these properties [6], the expected value always leads to information loss. The Global-topk [8] query returns k highest-ranked tuples according to their probability of being in the top-k answers in possible worlds. It satisfies all properties except 'Containment'.…”
Section: Introductionmentioning
confidence: 99%