Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2006
DOI: 10.1145/1148170.1148233
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid index maintenance for growing text collections

Abstract: We present a new family of hybrid index maintenance strategies to be used in on-line index construction for monotonically growing text collections. These new strategies improve upon recent results for hybrid index maintenance in dynamic text retrieval systems. Like previous techniques, our new method distinguishes between short and long posting lists: While short lists are maintained using a merge strategy, long lists are kept separate and are updated in-place. This way, costly relocations of long posting list… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2011
2011
2016
2016

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 31 publications
(34 citation statements)
references
References 15 publications
0
34
0
Order By: Relevance
“…An updatable keyword search system is usually implemented with a hierarchy of indexes [24,14,21,26]. New data is accumulated in a small updatable structure that also supports concurrent queries, while the main part of the hierarchy consists of a set of read-only indexes.…”
Section: System Modelmentioning
confidence: 99%
See 3 more Smart Citations
“…An updatable keyword search system is usually implemented with a hierarchy of indexes [24,14,21,26]. New data is accumulated in a small updatable structure that also supports concurrent queries, while the main part of the hierarchy consists of a set of read-only indexes.…”
Section: System Modelmentioning
confidence: 99%
“…We use cost models to explore this large space. In information retrieval, cost models have traditionally been used to explain and compare the relative merits of different algorithms [35,34,14,16]. In this paper, however, the cost models are used in optimization algorithms to select between different access designs for a stratified workload of updates and queries.…”
Section: Cost Models and Optimizationmentioning
confidence: 99%
See 2 more Smart Citations
“…More recently, Luk and Lam [23] describe an internal-memory inverted file implementation based on hash tables with chaining, but their method also does not support fast item removals. Lester et al [21,22] and Büttcher et al [10] describe externalmemory inverted file implementations that support item insertions only. Büttcher and Clarke [9] consider trade-offs for allowing for both item insertions and removals, and Guo et al [17] give a solution for performing such operations by using a B-tree variant.…”
Section: Introductionmentioning
confidence: 99%