2012
DOI: 10.1007/s10590-012-9132-2
|View full text |Cite
|
Sign up to set email alerts
|

N-gram posterior probability confidence measures for statistical machine translation: an empirical study

Abstract: We report an empirical study of n-gram posterior probability confidence measures for statistical machine translation (SMT). We first describe an efficient and practical algorithm for rapidly computing n-gram posterior probabilities from large translation word lattices. These probabilities are shown to be a good predictor of whether or not the n-gram is found in human reference translations, motivating their use as a confidence measure for SMT. Comprehensive n-gram precision and word coverage measurements are p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(21 citation statements)
references
References 26 publications
0
21
0
Order By: Relevance
“…Figure 2a shows that up to k = 10, 000 higher value of k to extract the rewriting phrase table increase the BLEU score on the test set. 12 We did not experiment with higher values of k, but plan to use the output lattice produced by 1-pass Moses to compute efficiently posteriors for larger sets of bi-phrases (de Gispert et al, 2013).…”
Section: Rewriter Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Figure 2a shows that up to k = 10, 000 higher value of k to extract the rewriting phrase table increase the BLEU score on the test set. 12 We did not experiment with higher values of k, but plan to use the output lattice produced by 1-pass Moses to compute efficiently posteriors for larger sets of bi-phrases (de Gispert et al, 2013).…”
Section: Rewriter Resultsmentioning
confidence: 99%
“…• phrase-based confidence score : bi-phrases are associated to a posterior probability, inspired from n-gram posterior probability estimation as defined in (de Gispert et al, 2013). Let E be the set of all hypotheses in the space of translation hypotheses defined by the n-best list used for source sentence f , and E α be the subset of E such that word alignments in sentence pairs (e , f ), ∀e ∈ E α , allow us to extract bi-phrase α.…”
Section: Reranking and Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Each arc labelled u receives a score equal to the posterior unigram probability P (u|ε) of the system generating u at this position. P (u|ε) is computed as in (de Gispert et al, 2013):…”
Section: Position Imentioning
confidence: 99%
“…The lattices representing the search space considered to generate these pseudo-references also allow us to estimate the posterior probability of a target word that quantifies the probability that it is part of the system output (Gispert et al, 2013). Posteriors aggregate two pieces of information for each word in the final hypothesis: first, all the paths in the lattice (i.e.…”
Section: World-level Quality Estimationmentioning
confidence: 99%