Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2000
DOI: 10.1145/345508.345584
|View full text |Cite
|
Sign up to set email alerts
|

The impact of database selection on distributed searching

Abstract: The proliferation of online information resources increases the importance of effective and efficient distributed searching. Distributed searching is cast in three parts -database selection, query processing, and results merging. In this paper we examine the effect of database selection on retrieval performance. We look at retrieval performance in three different distributed retrieval testbeds and distill some general results. First we find that good database selection can result in better retrieval effectiven… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
78
1
1

Year Published

2002
2002
2012
2012

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 101 publications
(83 citation statements)
references
References 20 publications
1
78
1
1
Order By: Relevance
“…A lot of research in meta search and distributed retrieval investigates mapping user queries to a set of categories or collections (Dolin et al, 1998;Fuhr, 1999;Gauch et al, 1996;Gravano and Garcia-Molina, 1995;Howe and Dreilinger, 1997;Powell et al, 2003;Xu and Croft, 1999;Yu et al, 2001). However, all of the above techniques return the same results for a given query regardless of the submitted query.…”
Section: Related Workmentioning
confidence: 99%
“…A lot of research in meta search and distributed retrieval investigates mapping user queries to a set of categories or collections (Dolin et al, 1998;Fuhr, 1999;Gauch et al, 1996;Gravano and Garcia-Molina, 1995;Howe and Dreilinger, 1997;Powell et al, 2003;Xu and Croft, 1999;Yu et al, 2001). However, all of the above techniques return the same results for a given query regardless of the submitted query.…”
Section: Related Workmentioning
confidence: 99%
“…Since doc 11 and doc 12 in cluster c 1 together contribute to sim(A, B), sim(B, C), and sim(A, C), while in c 2 , only doc 21 contributes to the similarity of all the C 2 4 = 6 possible term pairs, we compute the cluster weights to be…”
Section: Server Rankingmentioning
confidence: 99%
“…Most of the existing server selection methods [11] keep statistical information about the participant search servers' indexes. They invariably maintain a list of terms found in the indexed documents, while the weight of a term may vary across different methods.…”
Section: Introductionmentioning
confidence: 99%
“…the document score of the retrieved record in the first position). As a variant of this normalized score merging scheme, Powell et al [28] suggest normalizing the document score rsv j according to the following formula: …”
Section: Merging Strategiesmentioning
confidence: 99%