2022
DOI: 10.3390/educsci13010017
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Student Performance Using Clickstream Data and Machine Learning

Abstract: Student performance predictive analysis has played a vital role in education in recent years. It allows for the understanding students’ learning behaviours, the identification of at-risk students, and the development of insights into teaching and learning improvement. Recently, many researchers have used data collected from Learning Management Systems to predict student performance. This study investigates the potential of clickstream data for this purpose. A total of 5341 sample students and their click behav… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(7 citation statements)
references
References 54 publications
1
6
0
Order By: Relevance
“…Consequently, based on all three performance measures and their rather low standard deviation (computed from the 10-model accuracy), it can be said that the deep learning model can robustly predict the final score of learners when they complete 30% to 70% of their solutions. Our findings are consistent with those reported by Liu et al [34], Hajra et al [35], and Baranyi et al [36] in that deep neural networks could outperform traditional machine learning techniques in prediction tasks of learner performance. While like the work reported by Chen and Cui [37] and Okubo et al [38], we found that longer sequences of learner's actions could improve the prediction power of deep learning models, our results show that they may not necessarily be able to provide the optimal predictive performance.…”
Section: Performance Of Models Using Cross-validation: Validation Phasesupporting
confidence: 93%
“…Consequently, based on all three performance measures and their rather low standard deviation (computed from the 10-model accuracy), it can be said that the deep learning model can robustly predict the final score of learners when they complete 30% to 70% of their solutions. Our findings are consistent with those reported by Liu et al [34], Hajra et al [35], and Baranyi et al [36] in that deep neural networks could outperform traditional machine learning techniques in prediction tasks of learner performance. While like the work reported by Chen and Cui [37] and Okubo et al [38], we found that longer sequences of learner's actions could improve the prediction power of deep learning models, our results show that they may not necessarily be able to provide the optimal predictive performance.…”
Section: Performance Of Models Using Cross-validation: Validation Phasesupporting
confidence: 93%
“…This means that the model depends on the information available at that time to predict the student performance after a specific length of time, such as at the start of a course or semester. The CNN-LSTM architectures Aljaloud et al (2022); Chen et al (2022); Duru et al (2021); Li et al (2022); Liu et al (2023); Xiong et al (2022) have achieved a high accuracy of more than 70.25% and up to 91%, and on average 85.89%, except in Song et al (2020) which has achieved an accuracy of about 61% (See Figure 7).…”
Section: Hybrid DL Modelsmentioning
confidence: 99%
“…To predict student's success on the final exam, Sikder et al (2022) A study by Liu et al (2023) examined how clickstream data can be used to predict student performance. The study utilized the most important indicators of how well students would perform from the OULA dataset with weekly and monthly time intervals; these indicators included clicks on the homepage, related sites, quizzes, and content.…”
Section: Hybrid DL Techniquesmentioning
confidence: 99%
“…When all training examples in the terminal/leaf node t within the same class, then the stopping criterion is said to be reached (Junshuai, 2019) Every node matches a characteristic, while the branches link with an array of values. All nodes are labelled with the attributes they test, and every branch is labelled with its corresponding values (Liu et al, 2023). The range of values is mutually exclusive and complete.…”
Section: Decision Treesmentioning
confidence: 99%