2021
DOI: 10.48550/arxiv.2110.08191
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Why don't people use character-level machine translation?

Abstract: We present a literature and empirical survey that critically assesses the state of the art in character-level modeling for machine translation (MT). Despite evidence in the literature that character-level systems are comparable with subword systems, they are virtually never used in competitive setups in WMT competitions. We empirically show that even with recent modeling innovations in characterlevel natural language processing, characterlevel MT systems still struggle to match their subword-based counterparts… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 42 publications
0
7
0
Order By: Relevance
“…Indeed, the curves for both BLEU (1a) and gender coverage (1b) have a rapid and steady initial increase, 11 which starts to level off around the 20th ckp. 12 Also, the BLEU trends reveal a divide across models (BPE>CHAR) that remains visible over the 9 Contemporary to our submission, Libovickỳ et al (2021) show that en-de MT systems based on character-level segmentation have an edge -with respect to BPE -in terms of gender accuracy on the WinoMT benchmark . Their results, however, do not distinguish between feminine and masculine translation capabilities.…”
Section: Overall Resultsmentioning
confidence: 68%
“…Indeed, the curves for both BLEU (1a) and gender coverage (1b) have a rapid and steady initial increase, 11 which starts to level off around the 20th ckp. 12 Also, the BLEU trends reveal a divide across models (BPE>CHAR) that remains visible over the 9 Contemporary to our submission, Libovickỳ et al (2021) show that en-de MT systems based on character-level segmentation have an edge -with respect to BPE -in terms of gender accuracy on the WinoMT benchmark . Their results, however, do not distinguish between feminine and masculine translation capabilities.…”
Section: Overall Resultsmentioning
confidence: 68%
“…Contemporary to our submission,Libovickỳ et al (2021) show that en-de MT systems based on character-level segmentation have an edge -with respect to BPE -in terms of gender accuracy on the WinoMT benchmark(Stanovsky et al, 2019). Their results, however, do not distinguish between feminine and masculine translation capabilities.10 For the sake of our analysis across epochs, we do not generate our final systems by averaging the 5 models around the best ckp as in andSavoldi et al (2022).…”
mentioning
confidence: 86%
“…We now explain the architecture used in our experiments. We build off of the previous work by using the CNN downsampling architecture followed by the Transformer and using Libovickỳ et al (2021)'s two-step decoding with an LSTM for upsampling. This previous work was only applied to fixed-length downsampling and upsampling, however the aforementioned WDD and SDD methods require variable-length downsampling and upsampling.…”
Section: Architecturementioning
confidence: 99%
“…To alleviate the problem of training time, several methods have been proposed to initially downsample characters into shorter sequences, which are then fed into the encoder or decoder. For discriminative tasks, these can be applied without any loss in performance (Tay et al, 2021), however for generative tasks like NMT, the performance is either untested or lacking when compared to character models without downsampling (Libovickỳ et al, 2021).…”
Section: Introductionmentioning
confidence: 99%