2007
DOI: 10.1002/asi.20694
|View full text |Cite
|
Sign up to set email alerts
|

Metadata harvesting for content‐based distributed information retrieval

Abstract: We propose an approach to content-based Distributed Information Retrieval based on the periodic and incremental centralization of full-content indices of widely dispersed and autonomously managed document sources. Inspired by the success of the Open Archive Initiative's (OAI) Protocol for metadata harvesting, the approach occupies middle ground between content crawling and distributed retrieval. As in crawling, some data move toward the retrieval process, but it is statistics about the content rather than cont… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 41 publications
0
6
0
Order By: Relevance
“…In addition to the practical implications of this work, the findings on the characteristics of domain-specific repositories, improvements in query-performance prediction, and application of query-performance prediction for improving retrieval performance in distributed repositories can have implications for several IR-research areas including the design of domain-specific repositories (Bhavnani et al, 2006;Finn & Kushmerick, 2006;Marcial & Hemminger, 2010;Muresan & Klavans, 2013;Pattuelli, 2011;Tang, Yang, & Song, 2013), distributed repositories and federated search (Avrahami et al, 2006;Davis & Lagoze, 2000; Paltoglou et al, 2011;Simeoni et al, 2008), query expansion (Alemayehu, 2003;Efthimiadis, 2000;Minker et al, 1973;Shiri & Revie, 2006), query analysis and characteristics (Bashir & Rauber, 2011), and proactive documentrecommendation systems (Liu et al, 2012).…”
Section: Discussionmentioning
confidence: 97%
See 2 more Smart Citations
“…In addition to the practical implications of this work, the findings on the characteristics of domain-specific repositories, improvements in query-performance prediction, and application of query-performance prediction for improving retrieval performance in distributed repositories can have implications for several IR-research areas including the design of domain-specific repositories (Bhavnani et al, 2006;Finn & Kushmerick, 2006;Marcial & Hemminger, 2010;Muresan & Klavans, 2013;Pattuelli, 2011;Tang, Yang, & Song, 2013), distributed repositories and federated search (Avrahami et al, 2006;Davis & Lagoze, 2000; Paltoglou et al, 2011;Simeoni et al, 2008), query expansion (Alemayehu, 2003;Efthimiadis, 2000;Minker et al, 1973;Shiri & Revie, 2006), query analysis and characteristics (Bashir & Rauber, 2011), and proactive documentrecommendation systems (Liu et al, 2012).…”
Section: Discussionmentioning
confidence: 97%
“…The improvements in query performance prediction can lead to several new applications for improving IR including selective query expansion (Alemayehu, 2003;Efthimiadis, 2000;Minker, Peltola, & Wilson, 1973;Shiri & Revie, 2006), federated search and query routing (Avrahami, Yau, Si, & Callan, 2006;Davis & Lagoze, 2000;Paltoglou, Salampasis, & Satratzemi, 2011;Simeoni, Yakici, Neely, & Crestani, 2008) and query-based proactive document recommendation systems (Liu, Lai, & Chen, 2012). For example, users can be recommended query-expansion terms for improved retrieval based on the predicted performance of alternative expanded queries.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Works about the harvesting process itself have tended either to be quite technical (Simeoni, Yakici, Neely, & Crestani, 2008) or to offer a general view of how a metadata harvesting process works in relation to issues of metadata quality and analysis of Dublin Core usage (Elings & Waibel, 2007;Graham, 2001;Hagedorn, 2003;Ward, 2004). Cole and Foulonneau's Using the Open Archives Initiative for Metadata Harvesting (2007) covers both the technical operations of OAI-PMH and how it works on the ground for those out there creating and managing metadata in their local environments.…”
Section: The Harvesting Processmentioning
confidence: 98%
“…Medeiros (2006), Shreeves, Riley, and Milewicz (2006), and Elings and Waibel (2007) all described the importance of metadata sharing and discussed issues with metadata aggregation. Simeoni, Yakici, Neely, and Crestani (2008) focused on a content-based distributed information retrieval approach which, they stated, "occupies middle ground between content crawling and distributed retrieval" (p. 12).…”
Section: Metadata Sharingmentioning
confidence: 99%