Fluency Boost Learning and Inference for Neural Grammatical Error Correction

Ge, Tao; Wei, Furu; Zhou, Ming

doi:10.18653/v1/p18-1097

Cited by 107 publications

(94 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Many recent advances in neural GEC aim at overcoming the mentioned data sparsity problem. Ge et al (2018a) proposed fluency-boost learning that generates additional training examples during training from an independent backward model or the forward model being trained. Xie et al (2018) sup-plied their model with noisy examples synthesized from clean sentences.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data

Grundkiewicz¹,

Junczys-Dowmunt²,

Heafield³

2019

Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications

148

133

View full text Add to dashboard Cite

Considerable effort has been made to address the data sparsity problem in neural grammatical error correction. In this work, we propose a simple and surprisingly effective unsupervised synthetic error generation method based on confusion sets extracted from a spellchecker to increase the amount of training data. Synthetic data is used to pre-train a Transformer sequence-to-sequence model, which not only improves over a strong baseline trained on authentic error-annotated data, but also enables the development of a practical GEC system in a scenario where little genuine error-annotated data is available. The developed systems placed first in the BEA19 shared task, achieving 69.47 and 64.24 F 0.5 in the restricted and low-resource tracks respectively, both on the W&I+LOCNESS test set. On the popular CoNLL 2014 test set, we report state-of-theart results of 64.16 M 2 for the submitted system, and 61.30 M 2 for the constrained system trained on the NUCLE and Lang-8 data.

show abstract

Section: Related Workmentioning

confidence: 99%

“…Other recent work focuses on improving model inference. Ge et al (2018a) proposed correcting a sentence more than once through multi-round model inference. Lichtarge et al (2018) introduced iterative decoding to incrementally correct a sentence with a high-precision system.…”

Section: Related Workmentioning

confidence: 99%

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data

Grundkiewicz¹,

Junczys-Dowmunt²,

Heafield³

2019

Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications

148

133

View full text Add to dashboard Cite

show abstract

“…• Word2vec. Recent works on learnable evaluation metrics use simple word embeddings such as word2vec and GLoVe as input to their models (Tao et al, 2018;Lowe et al, 2017;Kannan and Vinyals, 2017). Since these static embeddings have a fixed contextindependent representation for each word, they cannot represent the rich semantics of words in contexts.…”

Section: Word Embeddingsmentioning

confidence: 99%

“…The Referenced metric and Unreferenced metric Blended Evaluation Routine (RUBER) (Tao et al, 2018) stands out from recent work in automatic dialogue evaluation, relying minimally on human-annotated datasets of response quality for training. RUBER evaluates responses with a blending of scores from two metrics: • an Unreferenced metric, which computes the relevancy of a response to a given query inspired by Grice (1975)'s theory that the quality of a response is determined by its relatedness and appropriateness, among other properties.…”

Section: Introductionmentioning

confidence: 99%

Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural Language Generation

2019

View full text Add to dashboard Cite

In this paper, we extend the persona-based sequence-to-sequence (Seq2Seq) neural network conversation model to a multi-turn dialogue scenario by modifying the state-ofthe-art hredGAN architecture to simultaneously capture utterance attributes such as speaker identity, dialogue topic, speaker sentiments and so on. The proposed system, phredGAN has a persona-based HRED generator (PHRED) and a conditional discriminator. We also explore two approaches to accomplish the conditional discriminator: (1) phredGAN a , a system that passes the attribute representation as an additional input into a traditional adversarial discriminator, and (2) phredGAN d , a dual discriminator system which in addition to the adversarial discriminator, collaboratively predicts the attribute(s) that generated the input utterance. To demonstrate the superior performance of phredGAN over the persona Seq2Seq model, we experiment with two conversational datasets, the Ubuntu Dialogue Corpus (UDC) and TV series transcripts from the Big Bang Theory and Friends. Performance comparison is made with respect to a variety of quantitative measures as well as crowd-sourced human evaluation. We also explore the trade-offs from using either variant of phredGAN on datasets with many but weak attribute modalities (such as with Big Bang Theory and Friends) and ones with few but strong attribute modalities (customer-agent interactions in Ubuntu dataset).

show abstract

“…(Ng et al, 2013(Ng et al, , 2014. In the past few years, both GEC-tuned statistical machine translation (SMT) and neural machine translation (NMT) using sequence-to- * Equally contributed authors sequence (seq2seq) learning have demonstrated to be more effective in grammatical error correction than other approaches Ng, 2017, 2018;Ge et al, 2018;Zhao et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

Improving Precision of Grammatical Error Correction with a Cheat Sheet

Qiu¹,

Chen²,

Liu³

et al. 2019

Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications

View full text Add to dashboard Cite

In this paper, we explore two approaches of generating error-focused phrases and examine whether these phrases can lead to better performance in grammatical error correction for the restricted track of BEA 2019 Shared Task on GEC. Our results show that phrases directly extracted from GEC corpora outperform phrases from a statistical machine translation phrase table by a large margin. Appending er-ror+context phrases to the original GEC corpora yields comparably higher precision. We also explore the generation of artificial syntactic error sentences using error+context phrases for the unrestricted track. The additional training data greatly facilitates syntactic error correction (e.g., verb form) and contributes to better overall performance.

show abstract

Fluency Boost Learning and Inference for Neural Grammatical Error Correction

Cited by 107 publications

References 40 publications

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data

Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural Language Generation

Improving Precision of Grammatical Error Correction with a Cheat Sheet

Contact Info

Product

Resources

About