Modeling Target-Side Inflection in Neural Machine Translation

Tamchyna, Aleš; Marco, Marion Weller-Di; Fraser, Alexander

doi:10.18653/v1/w17-4704

Cited by 53 publications

(29 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Morphological generation of previously unencountered word forms is a crucial problem in many areas of natural language processing (NLP). High performance can lead to better systems for downstream tasks, e.g., machine translation (Tamchyna et al, 2017). Since existing lexicons have limited coverage, learning morphological inflection patterns from labeled data is an important mission and has recently been the subject of multiple shared tasks (Cotterell et al, 2016(Cotterell et al, , 2017a.…”

Section: Introductionmentioning

confidence: 99%

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

Kann¹,

Schütze²

2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Neural state-of-the-art sequence-to-sequence (seq2seq) models often do not perform well for small training sets. We address paradigm completion, the morphological task of, given a partial paradigm, generating all missing forms. We propose two new methods for the minimalresource setting: (i) Paradigm transduction: Since we assume only few paradigms available for training, neural seq2seq models are able to capture relationships between paradigm cells, but are tied to the idiosyncracies of the training set. Paradigm transduction mitigates this problem by exploiting the input subset of inflected forms at test time. (ii) Source selection with high precision (SHIP): Multi-source models which learn to automatically select one or multiple sources to predict a target inflection do not perform well in the minimal-resource setting. SHIP is an alternative to identify a reliable source if training data is limited. On a 52-language benchmark dataset, we outperform the previous state of the art by up to 9.71% absolute accuracy.

show abstract

Section: Introductionmentioning

confidence: 99%

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

Kann¹,

Schütze²

2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…The output of the model is converted into surface forms in a separate, deterministic post-processing step. A similar two-step approach has been found to improve English to Czech NMT (Tamchyna et al, 2017), probably due to alleviating data sparsity caused by morphological complexity. As Finnish is also a morphologically complex language, adapting this approach to Finnish should result in a similar improvement.…”

Section: Nmt With Morphological Analysis and Generationmentioning

confidence: 93%

“…The annotation format used differs from the one in Tamchyna et al (2017) in several aspects, the most important of which is that the morphological tags are not complex, multicategory tags that are interleaved one-to-one with lemmas. Instead, each lemma token can be followed by zero or more morphological tags, each corresponding to a nondefault value in a single morphological category: The first lemma komissio is the only one without any morphological tags, the rest of the lemmas are trailed by one or more tags.…”

Section: Nmt With Morphological Analysis and Generationmentioning

confidence: 99%

The University of Helsinki submissions to the WMT18 news task

Raganato¹,

Scherrer²,

Nieminen

et al. 2018

Proceedings of the Third Conference on Machine Translation: Shared Task Papers

View full text Add to dashboard Cite

This paper describes the University of Helsinki's submissions to the WMT18 shared news translation task for English-Finnish and English-Estonian, in both directions. This year, our main submissions employ a novel neural architecture, the Transformer, using the open-source OpenNMT framework. Our experiments couple domain labeling and fine tuned multilingual models with shared vocabularies between the source and target language, using the provided parallel data of the shared task and additional back-translations. Finally, we compare, for the English-to-Finnish case, the effectiveness of different machine translation architectures, starting from a rule-based approach to our best neural model, analyzing the output and highlighting future research.

show abstract

“…Moreover, the generated morphological tags following slot placeholders can be used to limit the scope of possible surface forms during lexicalization (see Section 3.3). This approach is inspired by similar approaches in phrase-based MT (Bojar, 2007;Toutanova et al, 2008;Fraser, 2009) and was developed in parallel to recent similar experiments with two-step neural MT (Nadejde et al, 2017;Tamchyna et al, 2017). We compare the lemma-tag generation mode against the TGen default direct word-form generation mode in our experiments.…”

Section: Lemma-tag Generation Modementioning

confidence: 99%

Neural Generation for Czech: Data and Baselines

Dušek

Jurčíček

2019

Proceedings of the 12th International Conference on Natural Language Generation

View full text Add to dashboard Cite

We present the first dataset targeted at end-toend NLG in Czech in the restaurant domain, along with several strong baseline models using the sequence-to-sequence approach. While non-English NLG is under-explored in general, Czech, as a morphologically rich language, makes the task even harder: Since Czech requires inflecting named entities, delexicalization or copy mechanisms do not work out-ofthe-box and lexicalizing the generated outputs is non-trivial.In our experiments, we present two different approaches to this this problem: (1) using a neural language model to select the correct inflected form while lexicalizing, (2) a two-step generation setup: our sequence-to-sequence model generates an interleaved sequence of lemmas and morphological tags, which are then inflected by a morphological generator.• Using both automatic and manual evaluation in Section 4, we show that our extensions improve

show abstract

Modeling Target-Side Inflection in Neural Machine Translation

Cited by 53 publications

References 10 publications

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

The University of Helsinki submissions to the WMT18 news task

Neural Generation for Czech: Data and Baselines

Contact Info

Product

Resources

About