Pushdown Automata in Statistical Machine Translation

Allauzen, Cyril; Byrne, Bill; Gispert, Adri de; Iglesias, Gonzalo; Riley, Michael

doi:10.1162/coli_a_00197

Cited by 15 publications

(8 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A target language sentence y can be a translation of a source language sentence x if there is a derivation D in the grammar which yields both y and x: y = y(D), x = x(D). This defines a regular language Y over strings in the target language via a projection of the sentence to be translated: Y = {y(D) : x(D) = x} (Iglesias et al, 2011;Allauzen et al, 2014). Scores are defined over derivations via a log-linear model with features {φ i } and weights λ.…”

Section: Introductionmentioning

confidence: 99%

Syntactically Guided Neural Machine Translation

Stahlberg

Hasler

Waite

et al. 2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

Self Cite

View full text Add to dashboard Cite

We investigate the use of hierarchical phrase-based SMT lattices in end-to-end neural machine translation (NMT). Weight pushing transforms the Hiero scores for complete translation hypotheses, with the full translation grammar score and full ngram language model score, into posteriors compatible with NMT predictive probabilities. With a slightly modified NMT beam-search decoder we find gains over both Hiero and NMT decoding alone, with practical advantages in extending NMT to very large input and output vocabularies.

show abstract

Section: Introductionmentioning

confidence: 99%

Syntactically Guided Neural Machine Translation

Stahlberg

Hasler

Waite

et al. 2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Lexicographic semirings, used for PoS tagging disambiguation , have been also shown to be useful in other tasks (Sproat et al, 2014), such as optimized epsilon encoding for backoff language models , and hierarchical phrase-based decoding with Pushdown Automata (Allauzen et al, 2014).…”

Section: Conclusion and Related Workmentioning

confidence: 99%

Transducer Disambiguation with Sparse Topological Features

Iglesias

Gispert

Byrne

2015

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

Self Cite

View full text Add to dashboard Cite

We describe a simple and efficient algorithm to disambiguate non-functional weighted finite state transducers (WFSTs), i.e. to generate a new WFST that contains a unique, best-scoring path for each hypothesis in the input labels along with the best output labels. The algorithm uses topological features combined with a tropical sparse tuple vector semiring. We empirically show that our algorithm is more efficient than previous work in a PoStagging disambiguation task. We use our method to rescore very large translation lattices with a bilingual neural network language model, obtaining gains in line with the literature.

show abstract

“…SGNMT is based on the OpenFSTbased Cambridge SMT system (Allauzen et al, 2014). Although the system is less than a year old, we have found it to be very flexible and easy for new researchers to adopt.…”

Section: Introductionmentioning

confidence: 99%

“…The strict separation of scoring module and search strategy and the decoupling of scoring modules from each other makes SGNMT a very flexible decoding tool for neural and symbolic models which is applicable not only to machine translation. SGNMT is based on the OpenFSTbased Cambridge SMT system (Allauzen et al, 2014). Although the system is less than a year old, we have found it to be very flexible and easy for new researchers to adopt.…”

Section: Introductionmentioning

confidence: 99%

SGNMT – A Flexible NMT Decoding Platform for Quick Prototyping of New Models and Search Strategies

Stahlberg¹,

Hasler²,

Saunders³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

Self Cite

View full text Add to dashboard Cite

This paper introduces SGNMT, our experimental platform for machine translation research. SGNMT provides a generic interface to neural and symbolic scoring modules (predictors) with left-to-right semantic such as translation models like NMT, language models, translation lattices, n-best lists or other kinds of scores and constraints. Predictors can be combined with other predictors to form complex decoding tasks. SGNMT implements a number of search strategies for traversing the space spanned by the predictors which are appropriate for different predictor constellations. Adding new predictors or decoding strategies is particularly easy, making it a very efficient tool for prototyping new research ideas. SGNMT is actively being used by students in the MPhil program in Machine Learning, Speech and Language Technology at the University of Cambridge for course work and theses, as well as for most of the research work in our group.

show abstract

Pushdown Automata in Statistical Machine Translation

Cited by 15 publications

References 31 publications

Syntactically Guided Neural Machine Translation

Syntactically Guided Neural Machine Translation

Transducer Disambiguation with Sparse Topological Features

SGNMT – A Flexible NMT Decoding Platform for Quick Prototyping of New Models and Search Strategies

Contact Info

Product

Resources

About