Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error Correction

Maksym, Tarnavskyi,; Chernodub, Artem; Omelianchuk, Kostiantyn

doi:10.48550/arxiv.2203.13064

Cited by 1 publication

(1 citation statement)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…NMT approaches, which achieve state-of-the-art results, are encoder-decoder methods where encoders and decoders could be of different possible architectures such as RNNS , CNNS (Gehring et al, 2016), or Transformers (Vaswani et al, 2017), which were applied successfully on the GEC task (Yuan and Briscoe, 2016;Yuan et al, 2019;Junczys-Dowmunt et al, 2018). Recent approaches, utilize pre-trained large language models and achieve state-of-the-art results (Rothe et al, 2021;Tarnavskyi et al, 2022) by only fine-tuning them, solving the data bottleneck requirement for large networks.…”

Section: Approachesmentioning

confidence: 99%

CheckMate: English Grammatical Error Correction for Native Turkish Speakers

Ersoy,

Savlak,

Kart

et al. 2023

2023 31st Signal Processing and Communications Applications Conference (SIU)

View full text Add to dashboard Cite

Grammatical Error Correction has seen significant progress with the recent advancements in deep learning. As those methods require huge amounts of data, synthetic datasets are being built to fill this gap. Unfortunately, synthetic datasets are not organic enough in some cases and even require clean data to start with. Furthermore, most of the work that has been done is focused mostly on English. In this work, we introduce a new organic data-driven approach, clean insertions, to build parallel Turkish Grammatical Error Correction datasets from any organic data, and to clean the data used for training Large Language Models. We achieve state-of-the-art results on two Turkish Grammatical Error Correction test sets out of the three publicly available ones. We also show the effectiveness of our method on the training losses of training language models.

show abstract