Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2022
DOI: 10.18653/v1/2022.naacl-main.14
|View full text |Cite
|
Sign up to set email alerts
|

Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring?

Abstract: Deep Learning (DL) techniques have been increasingly adopted for Automatic Text Scoring in education. However, these techniques often suffer from their inabilities to explain and justify how a prediction is made, which, unavoidably, decreases their trustworthiness and hinders educators from embracing them in practice. This study aimed to investigate whether (and to what extent) DL-based graders align with human graders regarding the important words they identify when marking short answer questions. To this end… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…ASAG research, spanning three decades (Burrows, Gurevych, and Stein 2015), remains aligned with current NLP developments, as many ASAG systems now employ deep learning techniques (Bonthu, Sree, and Prasad 2021; Haller et al 2022). Additionally, recent advancements include generalizable ASAG, wherein models are trained to generalize to target domains that do not overlap with their training domain (Zeng et al 2023). Aside from systems primarily centered on score prediction, other research directions encompass using machines as a second grader (Kulkarni et al 2014), resorting to human evaluators for more challenging questions (Li et al 2023), exploring adversarial attacks on grading systems (Filighera, Steuer, and Rensing 2020;Filighera et al 2022), and generating explainable predictions (Tornqvist et al 2023), among other areas of investigation.…”
Section: Related Workmentioning
confidence: 99%
“…ASAG research, spanning three decades (Burrows, Gurevych, and Stein 2015), remains aligned with current NLP developments, as many ASAG systems now employ deep learning techniques (Bonthu, Sree, and Prasad 2021; Haller et al 2022). Additionally, recent advancements include generalizable ASAG, wherein models are trained to generalize to target domains that do not overlap with their training domain (Zeng et al 2023). Aside from systems primarily centered on score prediction, other research directions encompass using machines as a second grader (Kulkarni et al 2014), resorting to human evaluators for more challenging questions (Li et al 2023), exploring adversarial attacks on grading systems (Filighera, Steuer, and Rensing 2020;Filighera et al 2022), and generating explainable predictions (Tornqvist et al 2023), among other areas of investigation.…”
Section: Related Workmentioning
confidence: 99%
“…These new tools have enabled researchers to probe black‐box systems by comparing machine and human explanations. For instance, Zeng et al (2022) used a dataset of 60 answers provided by US Grade 10 pupils in subjects like English and Science to compare human and machine annotations. The researchers analysed whether several high‐performing ML systems agreed with a sample of 20 academics on the most important words in the pupil responses that determined the final mark (which could go from zero to three).…”
Section: Barriers For Using Artificial Intelligence and Machine Learn...mentioning
confidence: 99%