2014
DOI: 10.1162/coli_a_00197
|View full text |Cite
|
Sign up to set email alerts
|

Pushdown Automata in Statistical Machine Translation

Abstract: This article describes the use of pushdown automata (PDA) in the context of statistical machine translation and alignment under a synchronous context-free grammar. We use PDAs to compactly represent the space of candidate translations generated by the grammar when applied to an input sentence. General-purpose PDA algorithms for replacement, composition, shortest path, and expansion are presented. We describe HiPDT, a hierarchical phrase-based decoder using the PDA representation and these algorithms. We contr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 31 publications
0
8
0
Order By: Relevance
“…A target language sentence y can be a translation of a source language sentence x if there is a derivation D in the grammar which yields both y and x: y = y(D), x = x(D). This defines a regular language Y over strings in the target language via a projection of the sentence to be translated: Y = {y(D) : x(D) = x} (Iglesias et al, 2011;Allauzen et al, 2014). Scores are defined over derivations via a log-linear model with features {φ i } and weights λ.…”
Section: Introductionmentioning
confidence: 99%
“…A target language sentence y can be a translation of a source language sentence x if there is a derivation D in the grammar which yields both y and x: y = y(D), x = x(D). This defines a regular language Y over strings in the target language via a projection of the sentence to be translated: Y = {y(D) : x(D) = x} (Iglesias et al, 2011;Allauzen et al, 2014). Scores are defined over derivations via a log-linear model with features {φ i } and weights λ.…”
Section: Introductionmentioning
confidence: 99%
“…Lexicographic semirings, used for PoS tagging disambiguation , have been also shown to be useful in other tasks (Sproat et al, 2014), such as optimized epsilon encoding for backoff language models , and hierarchical phrase-based decoding with Pushdown Automata (Allauzen et al, 2014).…”
Section: Conclusion and Related Workmentioning
confidence: 99%
“…SGNMT is based on the OpenFSTbased Cambridge SMT system (Allauzen et al, 2014). Although the system is less than a year old, we have found it to be very flexible and easy for new researchers to adopt.…”
Section: Introductionmentioning
confidence: 99%
“…The strict separation of scoring module and search strategy and the decoupling of scoring modules from each other makes SGNMT a very flexible decoding tool for neural and symbolic models which is applicable not only to machine translation. SGNMT is based on the OpenFSTbased Cambridge SMT system (Allauzen et al, 2014). Although the system is less than a year old, we have found it to be very flexible and easy for new researchers to adopt.…”
Section: Introductionmentioning
confidence: 99%