2012
DOI: 10.1007/978-3-642-35063-4_1
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Performance of Pipelined Query Processing with Skipping

Abstract: Abstract. Web search engines need to provide high throughput and short query latency. Recent results show that pipelined query processing over a term-wise partitioned inverted index may have superior throughput. However, the query processing latency and scalability with respect to the collections size are the main challenges associated with this method. In this paper, we evaluate the effect of inverted index skipping on the performance of pipelined query processing. Further, we introduce a novel idea of using … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 12 publications
0
6
0
Order By: Relevance
“…The access cost is negligible compared to the cost of posting list processing. The broker node stores a memory-based array of maximum scores which contributes with another 200MB of data, but according to our observations in an earlier work [Jonassen and Bratsberg 2012a], this significantly improves the performance.…”
Section: Methodsmentioning
confidence: 85%
See 3 more Smart Citations
“…The access cost is negligible compared to the cost of posting list processing. The broker node stores a memory-based array of maximum scores which contributes with another 200MB of data, but according to our observations in an earlier work [Jonassen and Bratsberg 2012a], this significantly improves the performance.…”
Section: Methodsmentioning
confidence: 85%
“…The query processing framework is implemented in Java. 4 In the implementation, we use the Okapi BM25 scoring model with skipping and MaxScore optimizations [Jonassen and Bratsberg 2012a].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Documents can be retrieved and ranked by matching the query vector versus the document vector to compute the score or similarity. The retrieved documents are ranked according to the similarity to the user query [33][34][35][36].…”
Section: Matching and Rankingmentioning
confidence: 99%