Predicting reading comprehension from eye gaze data is a difficult task. We investigate the use of artificial neural networks (ANNs) to predict reading comprehension scores from eye gaze collected from participants who read and completed an online tutorial in our lab. Problems such as large feature sets and small highly imbalanced data sets compound to make this task even more complex. We propose using fuzzy output error (FOE) as an alternative performance function to mean square error (MSE) for training feed-forward neural networks to overcome these problems. We show that the use of FOE as the performance function for training ANNs provides significantly better classification of eye movements to reading comprehension scores. ANNs with three hidden layers of neurons gave the best classification results especially when FOE is used as the performance function for training. In these cases we found up to 50% reduction in misclassification rates compared to using MSE. We found that ANNs give optimal classification results in comparison to other classification techniques. When FOE is used as the performance function for training the ANNs the misclassification rates are halved compared to the other techniques. Cluster analysis was performed on one of the more complex data sets. Interesting reading behaviour properties were found within the data set. The intended use of this research is in the design of adaptive online learning environments that use eye gaze to predict user comprehension from reading behavior.
Imbalanced data sets are common in real life and can have a negative effect on classifier performance. We propose using fuzzy output error (FOE) as an alternative performance function to mean square error (MSE) for training feed forward neural networks to overcome this problem. The imbalanced data sets we use are eye gaze data recorded from reading and answering a tutorial and quiz. The goal is to predict the quiz scores for each tutorial page. We show that the use of FOE as the performance function for training neural networks provides significantly better classification of eye movements to reading comprehension scores. A neural network with three hidden layers of neurons gave the best classification results especially when FOE was used as the performance function for training. In these cases, upwards of a 19% reduction in misclassification was achieved compared to using MSE as the performance function.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.