Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
Federated search or distributed information retrieval routes the user's search query to multiple component collections and presents a merged result list in ranked order by comparing the relevance score of each returned result. However, the heterogeneity of the component collections makes it challenging for the central broker to compare these relevance scores while fusing the results into a single ranked list. To address this issue, most existing approaches merge the returned results by converting the document ranks to their ranking scores or downloading the documents and computing their relevance score. However, these approaches are not efficient enough, because the former methods suffer from limited efficacy of result merging due to the negligible number of overlapping documents and the latter are resource intensive. The current paper addresses this problem by proposing a new method that extracts features of both documents and component collections from the available information provided by the collections at query time. Each document and its collection features are exploited together to establish the document relevance score. The ant colony optimization is used for information retrieval to create a merged result list. The experimental results with the TREC 2013 FedWeb dataset demonstrate that the proposed method significantly outperforms the baseline approaches.
Federated search or distributed information retrieval routes the user's search query to multiple component collections and presents a merged result list in ranked order by comparing the relevance score of each returned result. However, the heterogeneity of the component collections makes it challenging for the central broker to compare these relevance scores while fusing the results into a single ranked list. To address this issue, most existing approaches merge the returned results by converting the document ranks to their ranking scores or downloading the documents and computing their relevance score. However, these approaches are not efficient enough, because the former methods suffer from limited efficacy of result merging due to the negligible number of overlapping documents and the latter are resource intensive. The current paper addresses this problem by proposing a new method that extracts features of both documents and component collections from the available information provided by the collections at query time. Each document and its collection features are exploited together to establish the document relevance score. The ant colony optimization is used for information retrieval to create a merged result list. The experimental results with the TREC 2013 FedWeb dataset demonstrate that the proposed method significantly outperforms the baseline approaches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.