Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183)
DOI: 10.1109/icdcs.1998.679488
|View full text |Cite
|
Sign up to set email alerts
|

Methodologies for distributed information retrieval

Abstract: Text collections have traditionally been located at a single site and managed as a monolithic whole. However, it is now common for a collection to be spread over several hosts and for these hosts to be geographically separated. In this paper we examine several alternative approaches to distributed text retrieval. We report on our experience with a full implementation of these methods, and give retrieval efficiency and retrieval effectiveness results for collections distributed over both a local area network an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(15 citation statements)
references
References 20 publications
0
15
0
Order By: Relevance
“…In principle, particularly in the absence of cooperation, it would be possible for a collection to contain many relevant documents, be highly ranked, but for the collection's ranking mechanism to be unable to find the documents. In practice, in cooperative systems in which information such as term statistics are shared, this is no more likely than in a monolithic system (de Kretser et al, 1998). Thus it is reasonable to compare systems by their ability to find collections with relevant documents, while noting that there remains the issue of how to find documents within collections and combine these results.…”
Section: Introductionmentioning
confidence: 99%
“…In principle, particularly in the absence of cooperation, it would be possible for a collection to contain many relevant documents, be highly ranked, but for the collection's ranking mechanism to be unable to find the documents. In practice, in cooperative systems in which information such as term statistics are shared, this is no more likely than in a monolithic system (de Kretser et al, 1998). Thus it is reasonable to compare systems by their ability to find collections with relevant documents, while noting that there remains the issue of how to find documents within collections and combine these results.…”
Section: Introductionmentioning
confidence: 99%
“…A straightforward way of distributing the retrieval task is to allocate each computer, or server, a defined fraction of the documents and then build an index for each local document set (Harman et al, 1991;de Kretser et al, 1998;Cahoon et al, 2000). Each index consists of a complete vocabulary for the documents on that computer and, for each term in the vocabulary, an inverted list recording the documents containing the term and (if phrase querying is to be supported) the positions in each document at which the term occurs.…”
Section: Document-partitioned Indexing and Queryingmentioning
confidence: 99%
“…Tmap1 estimates the agents' routing times by adding the costs of the next adjacent set of nodes in the sorted list. Exceeding the upper bound of the time constraint (the threshold ''T end ''), i.e., the impossibility of performing the task at the node within the time window, is not allowed when assigning an adjacent node to the tour (lines 22,23,26). In this manner, the path of the agents can be decided using the sorted list.…”
Section: Planning Algorithmsmentioning
confidence: 99%
“…In this application, information is spread over several nodes, which are commonly geographically separated [22]. Mobile agents migrate to the nodes where the data are located to perform their retrieval tasks there instead of transmitting data across the network.…”
Section: Introductionmentioning
confidence: 99%