Neural Sequence-Labelling Models for Grammatical Error Correction

Yannakoudakis, Helen; Rei, Marek; Andersen, Øistein E.; Yuan, Zheng

doi:10.18653/v1/d17-1297

Cited by 41 publications

(37 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Models can also be partially initialized by pre-training monolingual language models (Ramachandran et al, 2017) or only word-embeddings (Gangi and Federico, 2017). In GEC, Yannakoudakis et al (2017) apply pretrained monolingual word-embeddings as initializations for error-detection models to re-rank SMT n-best lists. Approaches based on pre-training with monolingual data appear to be particularly wellsuited to the GEC task.…”

Section: Transfer Learning For Gecmentioning

confidence: 99%

“…For the CoNLL 2014 benchmark on grammatical error correction , Junczys-Dowmunt and Grundkiewicz (2016) established a set of methods for GEC by SMT that remain state-of-the-art. Systems (Chollampatt and Ng, 2017;Yannakoudakis et al, 2017) that improve on results by Junczys-Dowmunt and Grundkiewicz (2016) use their set-up as a backbone for more complex systems.…”

Section: Introductionmentioning

confidence: 99%

“…Based on these developments, one would expect to see a rise of state-of-the-art neural methods for GEC, but as Junczys-Dowmunt and Grundkiewicz (2016) already noted, this is not the case. Interestingly, even today, the top systems on established GEC benchmarks are still mostly phrase-based or hybrid systems (Chollampatt and Ng, 2017;Yannakoudakis et al, 2017;Napoles and Callison-Burch, 2017). The best "pure" neural systems (Ji et al, 2017;Schmaltz et al, 2017) are several percent behind.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Junczys-Dowmunt¹,

Grundkiewicz²,

Guha³

et al. 2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

130

View full text Add to dashboard Cite

Previously, neural methods in grammatical error correction (GEC) did not reach state-ofthe-art results compared to phrase-based statistical machine translation (SMT) baselines. We demonstrate parallels between neural GEC and low-resource neural MT and successfully adapt several methods from low-resource MT to neural GEC. We further establish guidelines for trustable results in neural GEC and propose a set of model-independent methods for neural GEC that can be easily applied in most GEC settings. Proposed methods include adding source-side noise, domain-adaptation techniques, a GEC-specific training-objective, transfer learning with monolingual data, and ensembling of independently trained GEC models and language models. The combined effects of these methods result in better than state-of-the-art neural GEC models that outperform previously best neural GEC systems by more than 10% M 2 on the CoNLL-2014 benchmark and 5.9% on the JFLEG test set. Non-neural state-of-the-art systems are outperformed by more than 2% on the CoNLL-2014 benchmark and by 4% on JFLEG.

show abstract

Section: Transfer Learning For Gecmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Junczys-Dowmunt¹,

Grundkiewicz²,

Guha³

et al. 2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

130

View full text Add to dashboard Cite

show abstract

“…This improved the scores by about 0.3 F 0.5 in CoNLL-2014 and FCE-test Table 3: Our LM-based approach is compared against several state-of-the-art results. AMU16 SM T +LSTM and CAMB16 SM T +LSTM were both originally reported by Yannakoudakis et al (2017), while Lee and Lee (2014) is the system entered by POST in CoNLL-2014. Only our approach does not use annotated training data.…”

Section: Resultsmentioning

confidence: 99%

“…These approaches have since come to dominate the field, and a lot of recent research has focused on fine-tuning SMT systems (JunczysDowmunt and Grundkiewicz, 2016), reranking SMT output , combining SMT and classifier systems Rozovskaya and Roth, 2016), and developing various neural architectures Xie et al, 2016;Chollampatt and Ng, 2017;Yannakoudakis et al, 2017).…”

Section: Introductionmentioning

confidence: 99%

Language Model Based Grammatical Error Correction without Annotated Training Data

Bryant¹,

Briscoe²

2018

Proceedings of the Thirteenth Workshop on Innovative Use of NLP For Building Educational Applications

View full text Add to dashboard Cite

Since the end of the CoNLL-2014 shared task on grammatical error correction (GEC), research into language model (LM) based approaches to GEC has largely stagnated. In this paper, we re-examine LMs in GEC and show that it is entirely possible to build a simple system that not only requires minimal annotated data (∼1000 sentences), but is also fairly competitive with several state-of-the-art systems. This approach should be of particular interest for languages where very little annotated training data exists, although we also hope to use it as a baseline to motivate future research.

show abstract

Inductive Transfer Learning for Detection of Well-Formed Natural Language Search Queries

Syed

Indurthi

Gupta

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Neural Sequence-Labelling Models for Grammatical Error Correction

Cited by 41 publications

References 30 publications

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Language Model Based Grammatical Error Correction without Annotated Training Data

Inductive Transfer Learning for Detection of Well-Formed Natural Language Search Queries

Contact Info

Product

Resources

About