In the medical field, gene selection is critical, and it has the ability to diagnose diseases at an early stage. Data imbalance and poor feature selection performance are limitations in current techniques. Hyperband optimization is proposed in this paper to increase the performance of the XGBoost classifier. The NCBI gene dataset is utilised to evaluate the developed technique's performance in gene selection. The normalization procedure is used to scale the input data and decrease data discrepancies. When the Principal Component Analysis (PCA) method is used on input data to choose important features for classification, the independence variable becomes more difficult to interpret. To execute the gene selection for disease diagnosis, the selected features are applied to the XGBoost classifier. The hyperband optimization method searches in a distributed fashion to increase parameter exploration. The accuracy of the XGB-PCA-HO approach is 97.06%, XGB is 88.24%, and Random Forest is 85.29%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.