Should You Fine-Tune BERT for Automated Essay Scoring?

Mayfield, Elijah; Black, Alan W.

doi:10.18653/v1/2020.bea-1.15

Cited by 82 publications

(70 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Although the approach of [Fonseca et al 2018] achieved better results in both metrics for each competence (C1 to C5), these results are not fit for summative student assessment, as usually for the AES field, threshold values between 0.6 and 0.8 QWK are used as a floor for testing purposes [Mayfield and Black 2020]. Furthermore, the method of [Fonseca et al 2018], which achieved 75.20% in the QWK metric in their corpus, reached only 51% in the Essay-BR.…”

Section: Experiments and Resultsmentioning

confidence: 92%

Essay-BR: a Brazilian Corpus of Essays

Marinho

Anchiêta

Moura

2021

Anais Do III Dataset Showcase Workshop (DSW 2021)

View full text Add to dashboard Cite

Automatic Essay Scoring (AES) is the computer technology that evaluates and scores the written essays, aiming to provide computational models to grade essays automatically or with minimal human involvement. While there are several AES studies in a variety of languages, few of them are focused on the Portuguese language. The main reason is the lack of a corpus with manually graded essays. We create a large corpus with several essays written by Brazilian high school students on an online platform in order to bridge this gap. All of the essays are argumentative and were scored across five competences by experts. Moreover, we conducted an experiment on the created corpus and showed challenges posed by the Portuguese language. Our corpus is publicly available at https://github.com/rafaelanchieta/essay.

show abstract

Section: Experiments and Resultsmentioning

confidence: 92%

Essay-BR: a Brazilian Corpus of Essays

Marinho

Anchiêta

Moura

2021

Anais Do III Dataset Showcase Workshop (DSW 2021)

View full text Add to dashboard Cite

show abstract

“…Then, the embedding representation w t corresponding to w t is calculable as a dot product w t = A ⋅ w t . (Taghipour and Ng 2016;Alikaniotis et al 2016) -Hierarchical representation models (Dong and Zhang 2016;Dong et al 2017), -Coherence models (Tay et al 2018;Li et al 2018;Farag et al 2018;Mesgar and Strube 2018;Yang and Zhong 2021), -BERT-based models (Nadeem et al 2019;Rodriguez et al 2019;Yang et al 2020;Mayfield and Black 2020), -Hybrid models (Dasgupta et al 2018; -Robust model (Uto and Okano 2020)…”

Section: Rnn-based Modelmentioning

confidence: 99%

A review of deep-neural automated essay scoring models

Uto

2021

Behaviormetrika

View full text Add to dashboard Cite

Automated essay scoring (AES) is the task of automatically assigning scores to essays as an alternative to grading by humans. Although traditional AES models typically rely on manually designed features, deep neural network (DNN)-based AES models that obviate the need for feature engineering have recently attracted increased attention. Various DNN-AES models with different characteristics have been proposed over the past few years. To our knowledge, however, no study has provided a comprehensive review of DNN-AES models while introducing each model in detail. Therefore, this review presents a comprehensive survey of DNN-AES models, describing the main idea and detailed architecture of each model. We classify the AES task into four types and introduce existing DNN-AES models according to this classification.

show abstract

“…Similarly, [30] has proposed a BERT architecture for the AES task. The authors have utilized the pretrained BERT embedding and then apply the fine-tune.…”

Section: A Supervised Aesmentioning

confidence: 99%