With the exponential growth in web content and due to its sheer volume, the answers provided by traditional search engines by query specific keywords to content has resulted in markedly high recall and low precision. In order to alleviate this problem, the notion of incorporating semantics in content and in Search Engines, i.e., a Semantic Search Engine is increasingly crucial. Several Semantic Search Engines (SSEs) have been proposed and deployed till date and each is inherently different from the other. As such, the objective of this paper is to present a discussion on semantically enhanced search engines for intelligent web content discovery. We also present the architecture of a new SSE based on a bottom up approach that focuses on building a semantic base for Web content first and then carry out the process of querying it for attaining high precision and lower recall.
Cardiovascular disease (CVD) is a severe public health concern globally. Early and accurate CVD diagnosis is a difficult task but a necessary endeavour required to prevent further damage and protect patients’ lives. Machine Learning (ML)-based Clinical Decision Support Systems (CDSS) have the potential to assist healthcare providers in making accurate CVD diagnoses and treatments. Clinical data usually contains missing values (MVs); hence, the incorporated imputation techniques for ML have become a critical consideration when working with real-world medical datasets. Furthermore, removing instances with MVs will lead to essential data loss and produce incorrect results. To overcome these issues, this paper proposes an efficient and reliable CDSS with Ensemble Two-Fold Classification (ETC) framework for classifying heart diseases. The effectiveness of the proposed ETC framework using different supervised ML algorithms is evaluated with four distinct imputation methods for handling MVs over the standard benchmark dataset, viz., the University of California, Irwin (UCI). Experimental results show that our proposed ETC framework with the k-Nearest Neighbors(k-NN) imputation method achieves better classification accuracy of 0.9999 and a lesser error rate of 0.0989 compared to other imputation methods and classifiers with similar execution times.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.