Automatic Text Scoring Using Neural Networks

Alikaniotis, Dimitrios; Yannakoudakis, Helen; Rei, Marek

doi:10.18653/v1/p16-1068

Cited by 202 publications

(202 citation statements)

References 23 publications

Supporting

Mentioning

182

Contrasting

Order By: Relevance

“…For example, Taghipour and Ng (2016) explore simple LSTM and CNN-based architectures with regression and evaluate on the ASAP-AES data. Alikaniotis et al (2016) train score-specific word embeddings with several LSTM architectures. Dong and Zhang (2016) demonstrate that a hierarchical CNN architecture produces strong results on the ASAP-AES data.…”

Section: Introductionmentioning

confidence: 99%

Investigating neural architectures for short answer scoring

Riordan¹,

Horbach²,

Cahill³

et al. 2017

Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications

106

View full text Add to dashboard Cite

Neural approaches to automated essay scoring have recently shown state-of-theart performance. The automated essay scoring task typically involves a broad notion of writing quality that encompasses content, grammar, organization, and conventions. This differs from the short answer content scoring task, which focuses on content accuracy. The inputs to neural essay scoring models -ngrams and embeddings -are arguably well-suited to evaluate content in short answer scoring tasks. We investigate how several basic neural approaches similar to those used for automated essay scoring perform on short answer scoring. We show that neural architectures can outperform a strong nonneural baseline, but performance and optimal parameter settings vary across the more diverse types of prompts typical of short answer scoring.

show abstract

Section: Introductionmentioning

confidence: 99%

Investigating neural architectures for short answer scoring

Riordan¹,

Horbach²,

Cahill³

et al. 2017

Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications

106

View full text Add to dashboard Cite

show abstract

“…The main difference is that both earlier work treat the essay script as a sequence of words rathter than a sequence of sentences. Alikaniotis et al (2016) use score-specific word embeddings as word features and take the last hidden state of LSTM as text representation. Taghipour and Ng (2016) take the average value over all the hidden states of LSTM as text representation.…”

Section: Text Representationmentioning

confidence: 99%

“…Recently, Alikaniotis et al (2016) employ a long short-term memory model to learn features for essay scoring task automatically without any predefined feature templates. It leverages scorespecific word embeddings (SSWEs) for word representations, and takes the last hidden states of a two-layer bidirectional LSTM for essay representations.…”

Section: Promptmentioning

confidence: 99%

Attention-based Recurrent Convolutional Neural Network for Automatic Essay Scoring

Dong¹,

Zhang²,

Yang³

2017

Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017)

175

200

View full text Add to dashboard Cite

Neural network models have recently been applied to the task of automatic essay scoring, giving promising results. Existing work used recurrent neural networks and convolutional neural networks to model input essays, giving grades based on a single vector representation of the essay. On the other hand, the relative advantages of RNNs and CNNs have not been compared. In addition, different parts of the essay can contribute differently for scoring, which is not captured by existing models. We address these issues by building a hierarchical sentence-document model to represent essays, using the attention mechanism to automatically decide the relative weights of words and sentences. Results show that our model outperforms the previous stateof-the-art methods, demonstrating the effectiveness of the attention mechanism.

show abstract

“…The features used in previous work range from shallow textual features to discourse structure and semantic coherence (Higgins et al, 2004;Yannakoudakis and Briscoe, 2012;Somasundaran et al, 2014), and from prompt independent to dependent features (Cummins et al, 2016a). Some recent models have dispensed with feature engineering and utilised word embeddings and neural networks (Alikaniotis et al, 2016;Dong and Zhang, 2016;Taghipour and Ng, 2016).…”

Section: Related Workmentioning

confidence: 99%

The Effect of Adding Authorship Knowledge in Automated Text Scoring

Zhang¹,

Xie

Cummins

et al. 2018

Proceedings of the Thirteenth Workshop on Innovative Use of NLP For Building Educational Applications

View full text Add to dashboard Cite

Some language exams have multiple writing tasks. When a learner writes multiple texts in a language exam, it is not surprising that the quality of these texts tends to be similar, and the existing automated text scoring (ATS) systems do not explicitly model this similarity. In this paper, we suggest that it could be useful to include the other texts written by this learner in the same exam as extra references in an ATS system. We propose various approaches of fusing information from multiple tasks and pass this authorship knowledge into our ATS model on six different datasets. We show that this can positively affect the model performance in most cases.

show abstract

Automatic Text Scoring Using Neural Networks

Cited by 202 publications

References 23 publications

Investigating neural architectures for short answer scoring

Investigating neural architectures for short answer scoring

Attention-based Recurrent Convolutional Neural Network for Automatic Essay Scoring

The Effect of Adding Authorship Knowledge in Automated Text Scoring

Contact Info

Product

Resources

About