Learning Hierarchical Translation Spans

zhang, jingyi; Utiyama, Masao; Sumita, Eiichiro; Zhao, Hai

doi:10.3115/v1/d14-1022

Cited by 5 publications

(1 citation statement)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, their method is still slower than the -gram LM. Some other studies try to implement neural network LM or translation model for SMT [27], [28], [29], [30], [31], [32], [33], [34], [35], [36]. But until now, the decoding speed using -gram LM is still the stateof-the-art one.…”

Section: Introductionmentioning

confidence: 99%

Bilingual Continuous-Space Language Model Growing for Statistical Machine Translation

Wang

Zhao

et al. 2015

IEEE/ACM Trans. Audio Speech Lang. Process.

Self Cite

View full text Add to dashboard Cite

Larger -gram language models (LMs) perform better in statistical machine translation (SMT). However, the existing approaches have two main drawbacks for constructing larger LMs: 1) it is not convenient to obtain larger corpora in the same domain as the bilingual parallel corpora in SMT; 2) most of the previous studies focus on monolingual information from the target corpora only, and redundant -grams have not been fully utilized in SMT. Nowadays, continuous-space language model (CSLM), especially neural network language model (NNLM), has been shown great improvement in the estimation accuracies of the probabilities for predicting the target words. However, most of these CSLM and NNLM approaches still consider monolingual information only or require additional corpus. In this paper, we propose a novel neural network based bilingual LM growing method. Compared to the existing approaches, the proposed method enables us to use bilingual parallel corpus for LM growing in SMT. The results show that our new method outperforms the existing approaches on both SMT performance and computational efficiency significantly. Index Terms-Continuous-space language model, language model growing (LMG), neural network language model, statistical machine translation (SMT).1 It is common to use larger monolingual corpus in SMT, in comparison to the small bilingual parallel corpus. 2329-9290

show abstract