Literature in Molecular Biology is abundant with linguistic metaphors. There have been works in the past that attempt to draw parallels between linguistics and biology, driven by the fundamental premise that proteins have a language of their own. Since word detection is crucial to the decipherment of any unknown language, we attempt to establish a problem mapping from natural language text to protein sequences at the level of words. Towards this end, we explore the use of an unsupervised text segmentation algorithm to the task of extracting "biological words" from protein sequences. In particular, we demonstrate the effectiveness of using domain knowledge to complement data driven approaches in the text segmentation task, as well as in its biological counterpart. We also propose a novel extrinsic evaluation measure for protein words through protein family classification.
Case-Based Reasoning provides a framework for integrating domain knowledge with data in the form of four knowledge containers namely Case base, Vocabulary, Similarity and Adaptation. It is a known fact in Case-Based Reasoning community that knowledge can be interchanged between the containers. However, the explicit interplay between them, and how this interchange is affected by the knowledge richness of the underlying domain is not yet fully understood. We attempt to bridge this gap by proposing footprint size reduction as a measure for quantifying knowledge tradeoffs between containers. The proposed measure is empirically evaluated on synthetic as well as real world datasets. From a practical standpoint, footprint size reduction provides a unified way of estimating the impact of a given piece of knowledge in any knowledge container, and can also suggest ways of characterizing the nature of domains ranging from ill-defined to well-defined ones. Our study also makes evident the need for maintenance approaches that go beyond case base and competence to include other containers and performance objectives.
Holographic Case-Based Reasoning is a framework developed to build cognitively appealing case-based reasoners with proactive and interconnected cases. Improved realizations of the Holographic CBR framework are developed using the principles of dynamic memory proposed by Roger Schank and tested on their cognitive appeal, efficiency, and solution quality compared to other relevant systems.
In a path-breaking work, Kahneman characterized human cognition as a result of two modes of operation, Fast Thinking and Slow Thinking. Fast thinking involves quick, intuitive decision making and slow thinking is deliberative conscious reasoning. In this paper, for the first time, we draw parallels between this dichotomous model of human cognition and decision making in Case-based Reasoning (CBR). We observe that fast thinking can be operationalized computationally as the fast decision making by a trained machine learning model, or a parsimonious CBR system that uses few attributes. On the other hand, a full-fledged CBR system may be seen as similar to the slow thinking process. We operationalize such computational models of fast and slow thinking and switching strategies, as Models 1 and 2. Further, we explore the adaptation process in CBR as a slow thinking manifestation, leading to Model 3. Through an extensive set of experiments on real-world datasets, we show that such realizations of fast and slow thinking are useful in practice, leading to improved accuracies in decision-making tasks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.