Most of the operational information retrieval (IR) systems in existence today useBoolean logic during search. Such systems are usually called Boolean IR systems. These systems, as any other IR systems, are not perfect and the problem of their development (and hence providing better quality service for real users) is one of the most important problems in Information Science. From this viewpoint, the article analyzes existing criticisms of operational systems and points out some of their positive features. At the same time, certain negative effects which hinder the development of existing systems are considered. Finally, the article provides several conclusions about using Boolean logic in developing multiversion IR systems.
Combination of multiple evidences (multiple query formulations, multiple retrieval schemes or systems) has been shown (mostly experimentally) to be effective in data fusion in information retrieval. However, the question of why and how combination should be done still remains largely unanswered. In this paper, we provide a model for simulation and a framework for analysis in the study of data fusion in the information retrieval domain. A rank/score function is defined and the concept of a Cayley graph is used in the design and analysis of our framework. The model and framework have led us to better understanding of the data fusion phenomena in information retrieval. In particular, by exploiting the graphical properties of the rank/score function, we have shown analytically and by simulation that combination using rank performs better than combination using score under certain conditions. Moreover, we demonstrated that the rank/score function might be used as a predictive variable for the effectiveness of combination of multiple evidences.
Most of the operational information retrieval (IR) systems in existence today use Boolean logic during search. Such systems are usually called Boolean IR systems. These systems, as any other IR systems, are not perfect and the problem of their development (and hence providing better quality service for real users) is one of the most important problems in Information Science. From this viewpoint, the article analyzes existing criticisms of operational systems and points out some of their positive features. At the same time, certain negative effects which hinder the development of existing systems are considered. Finally, the article provides several conclusions about using Boolean logic in developing multiversion IR systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.