2004
DOI: 10.1109/tpds.2004.1264782
|View full text |Cite
|
Sign up to set email alerts
|

Information retrieval with distributed databases: analytic models of performance

Abstract: Abstract-The major emphasis of this paper is on analytical techniques for predicting the performance of various collection fusion scenarios. Knowledge of analytical models of information retrieval system performance, both with single processors and with multiple processors, increases our understanding of the parameters (e.g., number of documents, ranking algorithms, stemming algorithms, stop word lists, etc.) affecting system behavior. While there is a growing literature on the implementation of distributed in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2004
2004
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 40 publications
0
7
0
Order By: Relevance
“…When the data and DBMS software are distributed over several sites, one site may fail while other sites continue to operate. This improves both reliability and availability (Loose & Church, 2004).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…When the data and DBMS software are distributed over several sites, one site may fail while other sites continue to operate. This improves both reliability and availability (Loose & Church, 2004).…”
Section: Introductionmentioning
confidence: 99%
“…The next decade of research in database era will focus toward delivering widely available access to unprecedented amounts of constantly expanding data that is distributed all over the net (Loose & Church, 2004;Abiteboul et al, 2003;Ohta & Ishikawa, 2003). Users will benefit from new machine learning technologies that mine new knowledge by integrating an analyzing huge amounts of widely distributed data to uncover and report upon subtle relationships and patterns of events that are not immediately recognized by direct human inspection (Aljanaby et al, 2005;Kossman, 2000).…”
Section: Introductionmentioning
confidence: 99%
“…Other performance measures could be used for such a study, or the relationship between performance with this measure and performance with other measures might be studied (Losee, 2000). An earlier study on distributed information retrieval provides methods using Average Search Length as a performance measure that can address issues such as the clustering problems discussed above (Losee & Church, 2004). Using these techniques, we may generalize the study of the Clustering Performance Question to any number of clusters.…”
Section: Discussionmentioning
confidence: 99%
“…. n. In the fetching process [28], the system will fetch the information from the RBS and FBS, and we can set the range of the information from {0,1}. To fetch the information from database, we are considering a unit scale with datasets (A), the A of zero representing the average position of relevant documents being available inside the database at the beginning of the search process, and an A of one being at the end of the search process.…”
Section: Model Formulationmentioning
confidence: 99%