Diabetes is one of the common and growing diseases in several countries and all of them are working to prevent this disease at early stage by predicting the symptoms of diabetes using several methods. The main aim of this study is to compare the performance of algorithms those are used to predict diabetes using data mining techniques. In this paper we compare machine learning classifiers (J48 Decision Tree, K-Nearest Neighbors, and Random Forest, Support Vector Machines) to classify patients with diabetes mellitus. These approaches have been tested with data samples downloaded from UCI machine learning data repository. The performances of the algorithms have been measured in both the cases i.e dataset with noisy data (before pre-processing) and dataset set without noisy data (after pre-processing) and compared in terms of Accuracy, Sensitivity, and Specificity.
This paper introduces the concept of repetitive group sampling (RGS) for variables inspection. The repetitive group sampling plan for variables inspection will be useful when testing is costly and destructive. The advantages of the variables RGS plan over variables single sampling plan, variables double sampling plan and attributes RGS plan are discussed. Tables are also constructed for the selection of parameters of known and unknown standard deviation variables repetitive group sampling plan indexed by acceptable quality level and limiting quality level.Acceptable quality level, average sample number, limiting quality level, repetitive group sampling, sampling by variables,
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.