[1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems
DOI: 10.1109/pdis.1993.253078
|View full text |Cite
|
Sign up to set email alerts
|

Performance of inverted indices in shared-nothing distributed text document information retrieval systems

Abstract: The performance of distributed text document retrieval systems is strongly in uenced b y t h e o r ganization of the inverted index. This paper compares the performance impact on query processing of various physical organizations for inverted lists. We present a new probabilistic model of the database and queries. Simulation experiments determine which variables most strongly inuence r esponse time and throughput. This lea d s t o a set of design trade-o s over a range of hardware c on gurations and new parall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
83
0

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 67 publications
(83 citation statements)
references
References 12 publications
0
83
0
Order By: Relevance
“…The regression analysis performed confirms that the quadratic model fits better the real distribution (R = 0.99770) versus the linear model representing the Zipf's law (R = 0.98122). The quadratic model is similar to Zipf's, although in previous works [15], it has proved to match the actual distribution better. Given the quadratic fit curve, the form of the probability distribution Z 1 (w) is obtained from the quadratic model, divided by a normalisation constant [15].…”
Section: Document Modelmentioning
confidence: 98%
See 3 more Smart Citations
“…The regression analysis performed confirms that the quadratic model fits better the real distribution (R = 0.99770) versus the linear model representing the Zipf's law (R = 0.98122). The quadratic model is similar to Zipf's, although in previous works [15], it has proved to match the actual distribution better. Given the quadratic fit curve, the form of the probability distribution Z 1 (w) is obtained from the quadratic model, divided by a normalisation constant [15].…”
Section: Document Modelmentioning
confidence: 98%
“…The previous work for distributing the inverted index over a collection of servers is focused on the local and global inverted files strategies [13], [15], showing that the local inverted file is a more balanced strategy and a good query throughput could be achieved in most cases.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The work presented in [26] compares the impact of performance for queries processing, using two different organizations for the invested lists. It proposes two basic options to classify the indexes: disk index and system index.…”
Section: Previous Work and Motivationmentioning
confidence: 99%