2012
DOI: 10.1002/sres.2130
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Students' Progression in Higher Education by Using the Random Forest Algorithm

Abstract: This paper proposes the use of data available at Manchester Metropolitan University to assess the variables that can best predict student progression. We combine virtual learning environment (VLE) and management information systems student records datasets and apply the Random Forest (RF) algorithm to ascertain which variables can best predict students' progression. RF was deemed useful in this case because of the large amount of data available for analysis. The paper reports on the initial findings for data a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0
1

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(11 citation statements)
references
References 7 publications
0
10
0
1
Order By: Relevance
“…We used the machine learning technique 'Random Forest' (Breiman 2001) to identify the importance for specific survey instrument questions (Q1 through Q21) in predicting student 'Overall Satisfaction' (Q22). Machine learning methods are increasingly being used to analyse complex, large datasets in diverse fields of study, including psychology (Strobl et al 2009), ecology (Prasad et al 2006) and higher education research (Langan et al 2016;Hardman et al 2013). Random Forest analysis ultimately makes predictions based on variable associations, ranking the predictive importance of each variable (Grömping 2009;Genuer et al 2010).…”
Section: Machine Learning Analysismentioning
confidence: 99%
“…We used the machine learning technique 'Random Forest' (Breiman 2001) to identify the importance for specific survey instrument questions (Q1 through Q21) in predicting student 'Overall Satisfaction' (Q22). Machine learning methods are increasingly being used to analyse complex, large datasets in diverse fields of study, including psychology (Strobl et al 2009), ecology (Prasad et al 2006) and higher education research (Langan et al 2016;Hardman et al 2013). Random Forest analysis ultimately makes predictions based on variable associations, ranking the predictive importance of each variable (Grömping 2009;Genuer et al 2010).…”
Section: Machine Learning Analysismentioning
confidence: 99%
“…Hardmann et al . () highlighted the usefulness of random forest algorithm, a technique initially developed in biology, in evaluating an educational and learning environment. As researchers indicate that the education system represents an extremely complicated system that involves many aspects; thus, by using systems science methods (Lin et al .…”
Section: Discussionmentioning
confidence: 99%
“…Ho (2013) indicated recently that system concepts including cognitive system concepts, soft systems methodology, and soft systems thinking can be applied to e-learning systems. Hardmann et al (2013) highlighted the usefulness of random forest algorithm, a technique initially developed in biology, in evaluating an educational and learning environment. As researchers indicate that the education system represents an extremely complicated system that involves many aspects; thus, by using systems science methods (Lin et al 2013), we can analyse the complexity involved (Jackson, 1995).…”
Section: Discussionmentioning
confidence: 99%
“…These strategies were applied to select the Base-learners in the first layer. Random forest (RF) [23][24][25], SVM [26][27][28], and AdaBoost [29,30] classifiers were chosen in this study to be the Base-learners due to their better classification performances. They are commonly adopted as predictive models for student achievement prediction at present and have good performance.…”
Section: The Proposed Ensemble Modelmentioning
confidence: 99%