Early and accurately predicting the students' dropout enables schools to recognize the students based on available educational data. The early student dropout prediction is a major concern of education administrators. The existing classification techniques were unable to handle the early stage accurate performance of student dropout prediction with maximum accuracy and minimum time. In order to resolve the issue, a novel technique called rank biserial Otsuka-Ochiai stochastic embedded feature selection based bivariate kernelized regressive bootstrap aggregative classifier (RBOOSEFS-BKBAC) is motivated to perform student dropout prediction. The aim of the designing RBOOSEFS-BKBAC is to improve student dropout accuracy and minimal time consumption. Initially, the data preprocessing is to perform the data normalization, data cleaning, and duplicate data removal. Next, rank biserial correlation is used for discovering the correlated features. Followed by, Otsuka-Ochiai stochastic neighbor embedded feature selection is carried out to select significant features. Finally, bivariate kernelized regressive bootstrap aggregative classification technique is to perform classification with help of weak classifier. By using Bucklin voting scheme, the classification outcomes are obtained for increasing prediction accuracy as well as minimizing error. Experimental evaluation is performed by using Student-Drop-India2016 dataset with different metrics such as prediction accuracy, precision, recall, F-measure, as well as time. The result of proposed RBOOSEFS-BKBAC technique is provided that the higher prediction accuracy by 5% and lesser the prediction time by 15%, as compared to the state-of-the-art methods. K E Y W O R D Sbivariate kernelized regressive bootstrap aggregative classification, Otsuka-Ochiai stochastic neighbor embedded feature selection, preprocessing, rank biserial correlation-based feature extraction, school student dropout prediction INTRODUCTIONStudents' dropout is a difficult issue in the learning process for both the individual and society. Early prediction is a main challenge in the dropout prediction. Predicting the student's at-risk of dropout offers various benefits to educators, students, and the educational institution. Machine learning algorithms were applied in education to address the predicting dropout of students issue from schools. So that potential drop outs are documented
Prediction of student dropout in high school is a significant concern in education that affects both a state's education system and its financial system. Early prediction of school student dropout is not an easy issue to resolve since many factors that can influence student retention. The traditional classification techniques were used to solve this problem normally but the higher accuracy was not obtained. In order to improve the accuracy, a novel technique called Tucker's Congruence Regressive Target Feature Matching‐based Tversky Discriminant MIL Boost Data Classification (TCRTFM‐TDMBDC) is introduced. The proposed TCRTFM‐TDMBDC technique consists of four different processes namely data preprocessing, feature extraction, feature selection, and classification. At first, the data preprocessing is carried out for cleaning and altering the raw input data into a valuable and understandable format to minimize the complexity of the classification. After the preprocessing, the feature extraction is carried out by applying Modified Tucker's congruence correlative regression. Thirdly, the feature selection process is performed using Gaussian kernelized target projection feature matching to select the feature subset for accurate classification with minimum time consumption. Finally, the ensemble technique called Tversky Indxive generalized discriminant MIL boost is applied for classifying the given input student data with help of the weak learners. Based on the classification results, the student dropout prediction is accurately performed with minimum time. Experimental results reveal that the proposed technique noticeably predicts student dropout by means of prediction accuracy, precision, recall, F‐measure, and prediction time with respect to the number of student data. The discussed results illustrate that the proposed TCRTFM‐TDMBDC technique achieves higher accuracy with minimum prediction time than the state‐of‐the‐art methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.