Proceedings of the 29th ACM International Conference on Information &Amp; Knowledge Management 2020
DOI: 10.1145/3340531.3412080
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…There are also related strategies to obtain an accurate top-𝑘 threshold earlier, e.g. [31,33,35,37,42]. While we can benefit from these studies, this paper does not study them because they represent orthogonal optimizations.…”
Section: Background and Related Workmentioning
confidence: 99%
“…There are also related strategies to obtain an accurate top-𝑘 threshold earlier, e.g. [31,33,35,37,42]. While we can benefit from these studies, this paper does not study them because they represent orthogonal optimizations.…”
Section: Background and Related Workmentioning
confidence: 99%
“…For example, de Carvalho et al (2015) store the values of the kth highest scores in each postings list for certain values of k. Initializing to the largest (across the terms) of the kth largest (across documents) contributions C(t, d) is then safe, because for the given query there must be at least k similarity scores greater than or equal to that value. Kane and Tompa (2018), Yafay andAltingovde (2019), andPetri et al (2019) have explored similar options, and Mallia et al, (2020) compare a range of such initializations.…”
Section: Introductionmentioning
confidence: 99%
“…A second strand of development has pursued non-safe approaches, including the quit/continue heuristics [51]; and approximations that deliberately over-estimate the current heap entry threshold (when maintaining the top-𝑘 set) [10,14,16,42,64]. In related work, researchers have sought bounds on the query's final 𝑘 th largest document score [19,29,73], or to provide a conservative estimate of it [49,57], seeking to bypass fruitless early work when dynamic pruning mechanisms are in play.…”
Section: Introductionmentioning
confidence: 99%