In the Software development life cycle (SDLC) the prediction of software defects is one of the crucial parts. Recent years have witnessed various studies for predicting software defects, and most of them involve machine learning techniques. Before predicting the defects some pre-processing steps are required, such as feature selection, outlier removal, and feature scaling as it helps in improving accuracy and reducing the execution time(ET). The present investigation focuses on feature selection which is a dimensionality reduction technique. Further, we have also proposed a framework on module-based software defect prediction using feature selection techniques. These techniques are divided into three categories i.e filter methods, wrapper methods, and hybrid methods that combine two wrapper methods i.e. Sequential Forward Selection and Sequential Backward Selection that we have developed. Finally, classification has been performed by KNN, Logistic Regression, Decision Tree, and SVM using the above-mentioned feature selection techniques on eight publicly available Promise datasets and compared them with the existing state-of-the-art(SOTA) methods. The result shows that the hybrid method performs better in terms of accuracy by 4.2%, 3.9%, 3.8% on datasets pc4, jm1, kc2 respectively when applied along with machine learning, as compared to filter and wrapper methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.