2014
DOI: 10.1145/2661629
|View full text |Cite
|
Sign up to set email alerts
|

Cache Design of SSD-Based Search Engine Architectures

Abstract: Caching is an important optimization in search engine architectures. Existing caching techniques for search engine optimization are mostly biased towards the reduction of random accesses to disks, because random accesses are known to be much more expensive than sequential accesses in traditional magnetic hard disk drive (HDD). Recently, solid-state drive (SSD) has emerged as a new kind of secondary storage medium, and some search engines like Baidu have already used SSD to completely replace HDD in their infra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0
1

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 50 publications
0
7
0
1
Order By: Relevance
“…In-storage NDP. In-storage NDP has been widely studied in various use-cases, such as databases [17,19,31,37], map-reduce [23,24,29], regex [8,28], searching [19,35,36], etc. A common theme of such works is to explore the idea of pushing computation to the storage devices for improved performance.…”
Section: Background and Related Workmentioning
confidence: 99%
“…In-storage NDP. In-storage NDP has been widely studied in various use-cases, such as databases [17,19,31,37], map-reduce [23,24,29], regex [8,28], searching [19,35,36], etc. A common theme of such works is to explore the idea of pushing computation to the storage devices for improved performance.…”
Section: Background and Related Workmentioning
confidence: 99%
“…List caching involves storing commonly accessed postings lists in fast-access memory [18,72]. For example, if the postings lists making up the index are stored on a SSD, a list caching strategy may opt to keep to n most accessed postings in main-memory.…”
Section: Caching For Large-scale Searchmentioning
confidence: 99%
“…The performance of all of the approaches degrades as ϵ increases, and they are vulnerable to inaccuracies if the clustering error might be high. The key point is that if a search engine receives an exact match well-known query, it can simply return a cached result, which is what is commonly done in practice [3,18,32,36,50,72]; what we are exploring here is the consequence of also allowing approximated matches to be exploited.…”
Section: Online Re-ranking Using Real Queriesmentioning
confidence: 99%
See 2 more Smart Citations