2013
DOI: 10.1007/978-3-319-02432-5_9
|View full text |Cite
|
Sign up to set email alerts
|

Query Processing in Highly-Loaded Search Engines

Abstract: Abstract. While Web search engines are built to cope with a large number of queries, query traffic can exceed the maximum query rate supported by the underlying computing infrastructure. We study how response times and results vary when, in presence of high loads, some queries are either interrupted after a fixed time threshold elapses or dropped completely. Moreover, we introduce a novel dropping strategy, based on machine learned performance predictors to select the queries to drop in order to sustain the la… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…A variety of work has explored the efficiency of sharded search systems, covering topics including: reducing the communications and merging costs when large numbers of shards are searched [13]; load balancing in mirrored systems [23,38]; query shedding under high load to improve overall throughput [10]; and query pruning to improve efficiency [59]. Other work focuses on addressing the load imbal-ances that arise when non-random shards are used, including the development of techniques for strategic assignment of index postings to shards, and strategic replication of frequently-accessed elements [41,42].…”
Section: Distributed Retrievalmentioning
confidence: 99%
“…A variety of work has explored the efficiency of sharded search systems, covering topics including: reducing the communications and merging costs when large numbers of shards are searched [13]; load balancing in mirrored systems [23,38]; query shedding under high load to improve overall throughput [10]; and query pruning to improve efficiency [59]. Other work focuses on addressing the load imbal-ances that arise when non-random shards are used, including the development of techniques for strategic assignment of index postings to shards, and strategic replication of frequently-accessed elements [41,42].…”
Section: Distributed Retrievalmentioning
confidence: 99%
“…For example, consideration of the spatial and temporal variance in energy prices that RESQ exploits may lead to increased cost savings for cache eviction algorithms (e.g., [16] and [7]). Similarly, adaptive selection of underlying site retrieval strategies (e.g., based on query efficiency prediction [5]) may also help RESQ process the queued query volume at a local site within particularly low-or high-price time durations. While the design of RESQ does not necessarily preclude the adoption of such existing approaches, a fresh look at the problems in the context of rank-and energyawareness for distributed search systems may yield useful insights.…”
Section: Resultsmentioning
confidence: 99%
“…Daniele et al [15] use query efficiency predictors to feed a load-sensitive selective pruning framework and they also demonstrate that a mutiple feature predictor using DAAT is more accurate than a single feature one. In [16], authors use predictors to introduce a novel dropping strategy for maintaining the response times under a specified threshold.…”
Section: Query Efficiency Predictorsmentioning
confidence: 99%