The lost impact on the research process, can be serious in classifying results leading to biased parameter estimates, statistical information, decreased quality, increased standard error, and weak generalization of the findings. In this paper, we discuss the problems that exist in one of the algorithms, namely the Naive Bayes Kernel algorithm. The Naive Bayes kernel algorithm has the disadvantage of not being able to process data with the mission value. Therefore, in order to process missing value data, there is one method that we propose to overcome, namely using the mean imputation method. The data we use is public data from UCI, namely the HCV (Hepatisis C Virus) dataset. The input method used to correct the missing data so that it can be filled with the average value of the existing data. Before the imputation process means, the dataset uses yahoo bootstrap first. The data that has been corrected using the mean imputation method has just been processed using the Naive Bayes Kernel Algorithm. From the results of the research tests that have been carried out, it can be obtained an accuracy value of 96.05% and the speed of the data computing process with 1 second.
Heart failure is a type of disease that has the largest number of patients in the world. Based on information from the data center, there were 229,696 people with heart failure in 2013. Lack of public knowledge about what indications of a person having heart failure make the main cause not handled properly by heart failure patients. In this study, data classification was carried out using KNN algorithm because it has a simple calculation and has a fast time. This study only uses 12 attributes, while the previous study compared 6 algorithms with 13 attributes from 299 data. The highest algorithm with 94.31% accuracy by Random Forest while KNN had an accuracy rate of 86.95% with the same data. In this study, the accuracy of the sample data was compared between 20 data and 299 total data. Both of them have different accuracy. 20 sample data has an accuracy rate of 89.29% while 299 data has an accuracy rate of 96.66%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.