The thyroid gland is the crucial organ in the human body, secreting two hormones that help to regulate the human body’s metabolism. Thyroid disease is a severe medical complaint that could be developed by high Thyroid Stimulating Hormone (TSH) levels or an infection in the thyroid tissues. Hypothyroidism and hyperthyroidism are two critical conditions caused by insufficient thyroid hormone production and excessive thyroid hormone production, respectively. Machine learning models can be used to precisely process the data generated from different medical sectors and to build a model to predict several diseases. In this paper, we use different machine-learning algorithms to predict hypothyroidism and hyperthyroidism. Moreover, we identified the most significant features, which can be used to detect thyroid diseases more precisely. After completing the pre-processing and feature selection steps, we applied our modified and original data to several classification models to predict thyroidism. We found Random Forest (RF) is giving the maximum evaluation score in all sectors in our dataset, and Naive Bayes is performing very poorly. Moreover selecting the feature by using the feature importance method RF provides the best accuracy of 91.42%, precision of 92%, recall of 92% and F1-score of 92%. Further, by analyzing the characteristics and behavior of the dataset, we identified the most important features (TSH, T3, TT4, and FTI) of the dataset. In terms of accuracy and other performance evaluation criteria, this study could advocate the use of effective classifiers and features backed by machine learning algorithms to detect and diagnose thyroid disease. Finally, we did some explainability analysis of our best classifier to understand the internal black-box of our machine learning model and datasets. This study could further pave the way for the researcher as well as healthcare professionals to analyze thyroid disease in real time applications.
The thyroid gland is the key organs in the human body, secreting two hormones that help to regulate the human body's metabolism. Thyroid disease is a severe medical complaint that could developed by high TSH (Thyroid Stimulating Hormone) levels or an infection in the thyroid tissues. Hypothyroidism and hyperthyroidism are two important conditions caused by insufficient thyroid hormone production and excessive thyroid hormone production, respectively. Machine learning model can utilize for precise processing of the data that is generated from different the medical sector and could be used for building a model for the prediction of several diseases. In this study, we used a variety of machine learning algorithm to predict hypothyroidism and hyperthyroidism. Moreover, we identified the most significant features, which can be used to detect thyroid diseases more precisely. After completing the preprocessing and feature selection steps, we applied our modified and original data to several classification models to predict thyroidism. Finally, we found Random Forest is giving the maximum score in all sectors like accuracy, precision, recall, F1 score in our dataset and Naive Bayes is performing very poorly. By analyzing the characteristics and behavior of the dataset, we can identify the most important features of the datasets. In terms of accuracy and other performance evaluation criteria, this study could advocate the use of effective classifiers and features backed by machine learning algorithms for the detection and diagnosis of thyroid disease.
The thyroid gland is the key organs in the human body, secreting two hormones that help to regulate the human body's metabolism. Thyroid disease is a severe medical complaint that could developed by high TSH (Thyroid Stimulating Hormone) levels or an infection in the thyroid tissues. Hypothyroidism and hyperthyroidism are two important conditions caused by insufficient thyroid hormone production and excessive thyroid hormone production, respectively. Machine learning model can utilize for precise processing of the data that is generated from different the medical sector and could be used for building a model for the prediction of several diseases. In this study, we used a variety of machine learning algorithm to predict hypothyroidism and hyperthyroidism. Moreover, we identified the most significant features, which can be used to detect thyroid diseases more precisely. After completing the preprocessing and feature selection steps, we applied our modified and original data to several classification models to predict thyroidism. Finally, we found Random Forest is giving the maximum score in all sectors like accuracy, precision, recall, F1 score in our dataset and Naive Bayes is performing very poorly. By analyzing the characteristics and behavior of the dataset, we can identify the most important features of the datasets. In terms of accuracy and other performance evaluation criteria, this study could advocate the use of effective classifiers and features backed by machine learning algorithms for the detection and diagnosis of thyroid disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.