With the vast growth of technology, the world is moving towards different style of instant food habits which lead to the irregular functioning of the body organs. One such victim problem we face is the existence of hypothyroid in the body. Hypothyroid is the under active thyroid circumstance, where the thyroid gland does not produce required amount of essential hormones. The prediction of hypothyroid still remains as a challenging task due to the non availability of exact symptoms. By keeping this analysis in mind, this paper focus on prediction of hypothyroid based on the clinical parameters. The hypothyroid dataset from the UCI machine learning repository is used for predicting the existence of hypothyroid using machine learning classification algorithms. The prediction of existence of hypothyroid is carried out in four ways. Firstly, the raw data set is fitted with various classification algorithms to find the existence of hypothyroid. Secondly, the data set is tailored by the Ada Boost Regressor algorithm to extract the important features from the hypothyroid dataset. Then the extracted feature importance of the hypothyroid dataset is then fitted to the various classification algorithms. Thirdly, the hypothyroid dataset is subjected to the dimensionality reduction using principal component analysis. The PCA reduced hypothyroid dataset is then fitted with classification algorithms to predict the existence of hypothyroid. Fourth, the performance analysis is done for the raw data set, Feature importance AdaBoost hypothyroid dataset and PCA reduced hypothyroid dataset by comparing the performance metrics like precision, recall, FScore and Accuracy. This paper is implemented by python scripts in Anaconda Spyder Navigator. Experimental Result shows that the Random Forest, Naive Bayes and Logistic regression have the accuracy of 99.5 for the raw dataset, feature importance reduced dataset and the accuracy of 99.8 for the five component reduced PCA dataset.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.