Proceedings of the Twelfth International Conference on Information and Knowledge Management 2003
DOI: 10.1145/956863.956944
|View full text |Cite
|
Sign up to set email alerts
|

Efficient query evaluation using a two-level retrieval process

Abstract: We present an efficient query evaluation method based on a two level approach: at the first level, our method iterates in parallel over query term postings and identifies candidate documents using an approximate evaluation taking into account only partial information on term occurrences and no query independent factors; at the second level, promising candidates are fully evaluated and their exact scores are computed. The efficiency of the evaluation process can be improved significantly using dynamic pruning t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

4
324
0

Year Published

2006
2006
2019
2019

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 319 publications
(330 citation statements)
references
References 19 publications
4
324
0
Order By: Relevance
“…In order to get the successor of 49, firstly we find the node which has the longest common prefix of 00110001. We run binary search over the 8 hash tables and get 001 in Table [6]. After that, we can find the successor via its right pointer which pointed to the successor node at the leaf level.…”
Section: Data Structurementioning
confidence: 99%
“…In order to get the successor of 49, firstly we find the node which has the longest common prefix of 00110001. We run binary search over the 8 hash tables and get 001 in Table [6]. After that, we can find the successor via its right pointer which pointed to the successor node at the leaf level.…”
Section: Data Structurementioning
confidence: 99%
“…However, this also means that an entire block must be decompressed even when just a single posting is required from it (e.g. for partial scoring approaches such as WAND [15]). Moreover, it is possible to obtain a larger output than the input when there are not enough integers to compress, because extra space is required in the output to store information needed at decompression time.…”
Section: List-adaptive Codecsmentioning
confidence: 99%
“…For instance, at the matching layer, dynamic pruning techniques such as WAND [15] enhance efficiency by omitting the scoring of documents that cannot reach the final retrieved set. In the top-most re-ranking layer, Cambazoglu et al [16] showed how learning to rank models could be simplified to enhance their efficiency.…”
Section: Introductionmentioning
confidence: 99%
“…Despite various attempts to displace inverted indexes from their dominant position for document ranking tasks over the years, no alternative has been able to consistently produce the same level of efficiency, effectiveness, and time / space trade-offs that inverted indexes can provide (see, for instance Zobel et al [45]). Ranked document retrieval requires that only the top-k documents are returned, and, as a result, researchers have proposed many heuristic approaches to improve top-k efficiency [1,4,5,6,32,38]. These approaches can be classified in two general categories: term-at-a-time (TAAT) and document-at-a-time (DAAT).…”
Section: Inverted Indexesmentioning
confidence: 99%
“…As the minimum bounding score in the heap slowly increases, more and more postings can be omitted. Enhanced DAAT pruning strategies similar in spirit to MAXSCORE have been shown to further increase efficiency [4,38]. Turtle and Flood also describe a similar approach to improve the efficiency of TAAT strategies.…”
Section: Document-at-a-time Processing (Daat)mentioning
confidence: 99%