2022
DOI: 10.21512/comtech.v13i1.7388
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of Undergraduate Student’s Study Completion Status Using MissForest Imputation in Random Forest and XGBoost Models

Abstract: The number of higher education graduates in Indonesia is calculated based on their completion status. However, many undergraduate students have reached the maximum length of study, but their completion status is unknown. This condition becomes a problem in calculating the actual number of graduates as it is used as an indicator of higher education evaluation and other policy references. Therefore, the unknown completion status of the students who have reached the maximum length of study must be predicted. The … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 20 publications
1
3
0
Order By: Relevance
“…The present study may conclude that the proposed model achieved the highest accuracy, precision, and recall for predicting poor-performing students. The present study shows that the ensemble model performs better than the single base classifier and is consistent with the findings of previous researchers [28]- [30].…”
Section: Discussionsupporting
confidence: 93%
“…The present study may conclude that the proposed model achieved the highest accuracy, precision, and recall for predicting poor-performing students. The present study shows that the ensemble model performs better than the single base classifier and is consistent with the findings of previous researchers [28]- [30].…”
Section: Discussionsupporting
confidence: 93%
“…Woo and Kim (2022) investigated the effect of learning orientation on the modelling stage based on gender via XGBoost model. Nirmala et al (2022) compared the performance of Random Forest and XGBoost models in predicting the completion status of the students reaching the maximum length of study.…”
Section: Discussionmentioning
confidence: 99%
“…The random Forest algorithm in this study was conducted on the default hyperparameter, that was, 𝑚𝑡𝑟𝑦 = √𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑜𝑟𝑠 = 5 and 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑒𝑒𝑠 = 100. The value of 𝑚𝑡𝑟𝑦 defined the number of predictors involved in the best splitting [1]. Table 6 shows the performance measures of the resulting random forest.…”
Section: Modeling the Empirical Datamentioning
confidence: 99%
“…Each decision tree in a random forest is constructed independently. The final random forest prediction for the classification case is based on aggregating predictions with the majority of votes from all decision trees [1]. Random forest is one of the most popular ensemble methods because random forests can be applied to various prediction problems and produce competitive accuracy.…”
Section: Introductionmentioning
confidence: 99%