In this paper, we propose a contextual retrieval framework which incorporates the user and global contextual information into the probabilistic retrieval model. We investigate different techniques of using contextual information to improve information retrieval performance in details. In particular, (1) we use the related text contextual information for query expansion;(2) we use the granularity information to construct the document level index and paragraph level index; (3) we use the geographic information for filtering. In addition, a new term weighting function BM50 is proposed based on the global context information. This framework is adaptable and extensible. If there is a new context category, we can extend the existing search system to accommodate it. Finally, we report our experimental findings on TREC data sets.
In this paper, we propose a dual index model for contextual IR. For each query, we search against both document level and passage level indexes, and use the corresponding merge function to update the weights for both documents and paragraphs by combining the results from both indexes according to the granularity information in metadata. Experiments on 2004 TREC data show that a significant improvement can be made by using the dual index model.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.