N-gram posterior probability confidence measures for statistical machine translation: an empirical study

Gispert, Adrià de; Blackwood, Graeme; Iglesias, Gonzalo; Byrne, William

doi:10.1007/s10590-012-9132-2

Cited by 13 publications

(21 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Figure 2a shows that up to k = 10, 000 higher value of k to extract the rewriting phrase table increase the BLEU score on the test set. 12 We did not experiment with higher values of k, but plan to use the output lattice produced by 1-pass Moses to compute efficiently posteriors for larger sets of bi-phrases (de Gispert et al, 2013).…”

Section: Rewriter Resultsmentioning

confidence: 99%

“…• phrase-based confidence score : bi-phrases are associated to a posterior probability, inspired from n-gram posterior probability estimation as defined in (de Gispert et al, 2013). Let E be the set of all hypotheses in the space of translation hypotheses defined by the n-best list used for source sentence f , and E α be the subset of E such that word alignments in sentence pairs (e , f ), ∀e ∈ E α , allow us to extract bi-phrase α.…”

Section: Reranking and Featuresmentioning

confidence: 99%

“…Bach et al (2011) worked on the issue of predicting sentence-level and word-level MT errors by using WPP and other features derived from the source context, the source-target alignment, and dependency structures, but relied on a significantly large manually annotated corpus of MT errors. De Gispert et al (2013) calculate k-gram posterior probabilities from n-best lists or word lattices, and demonstrated that they were reasonably accurate indications of whether specific kgrams would be found or not in human reference translations. Finally, the work of Blackwood et al (2010) proposed to segment translation lattices according to confidence measures over the maximum likelihood translation hypothesis to focus on regions with potential translation errors.…”

Section: Confidence Estimation Of Machine Translationmentioning

confidence: 99%

See 2 more Smart Citations

Confidence-based Rewriting of Machine Translation Output

Marie

Max

2014

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Numerous works in Statistical Machine Translation (SMT) have attempted to identify better translation hypotheses obtained by an initial decoding using an improved, but more costly scoring function. In this work, we introduce an approach that takes the hypotheses produced by a state-ofthe-art, reranked phrase-based SMT system, and explores new parts of the search space by applying rewriting rules selected on the basis of posterior phraselevel confidence. In the medical domain, we obtain a 1.9 BLEU improvement over a reranked baseline exploiting the same scoring function, corresponding to a 5.4 BLEU improvement over the original Moses baseline. We show that if an indication of which phrases require rewriting is provided, our automatic rewriting procedure yields an additional improvement of 1.5 BLEU. Various analyses, including a manual error analysis, further illustrate the good performance and potential for improvement of our approach in spite of its simplicity.

show abstract

Section: Rewriter Resultsmentioning

confidence: 99%

Section: Reranking and Featuresmentioning

confidence: 99%

Section: Confidence Estimation Of Machine Translationmentioning

confidence: 99%

See 1 more Smart Citation

Confidence-based Rewriting of Machine Translation Output

Marie

Max

2014

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

show abstract

“…Each arc labelled u receives a score equal to the posterior unigram probability P (u|ε) of the system generating u at this position. P (u|ε) is computed as in (de Gispert et al, 2013):…”

Section: Position Imentioning

confidence: 99%

LIMSI's Contribution to the WMT'16 Biomedical Translation Task

Ive¹,

Max²,

Yvon³

2016

Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers

View full text Add to dashboard Cite

The article describes LIMSI's submission to the first WMT'16 shared biomedical translation task, focusing on the sole English-French translation direction. Our main submission is the output of a MOSES-based statistical machine translation (SMT) system, rescored with Structured OUtput Layer (SOUL) neural network models. We also present an attempt to circumvent syntactic complexity: our proposal combines the outputs of PB-SMT systems trained either to translate entire source sentences or specific syntactic constructs extracted from those sentences. The approach is implemented using Confusion Network (CN) decoding. The quality of the combined output is comparable to the quality of our main system.

show abstract

“…The lattices representing the search space considered to generate these pseudo-references also allow us to estimate the posterior probability of a target word that quantifies the probability that it is part of the system output (Gispert et al, 2013). Posteriors aggregate two pieces of information for each word in the final hypothesis: first, all the paths in the lattice (i.e.…”

Section: World-level Quality Estimationmentioning

confidence: 99%

Proceedings of the Ninth Workshop on Statistical Machine Translation

Bojar

Buck

Federmann

et al. 2014

View full text Add to dashboard Cite

The focus of our workshop was to use parallel corpora for machine translation. Recent experimentation has shown that the performance of SMT systems varies greatly with the source language. In this workshop we encouraged researchers to investigate ways to improve the performance of SMT systems for diverse languages, including morphologically more complex languages, languages with partial free word order, and low-resource languages.Prior to the workshop, in addition to soliciting relevant papers for review and possible presentation, we conducted four shared tasks: a general translation task, a medical translation task, a quality estimation task, and a task to test automatic evaluation metrics. The medical translation task was introduced this year to address the important issue of domain adaptation within SMT. The results of the shared tasks were announced at the workshop, and these proceedings also include an overview paper for the shared tasks that summarizes the results, as well as provides information about the data used and any procedures that were followed in conducting or scoring the task. In addition, there are short papers from each participating team that describe their underlying system in greater detail.Like in previous years, we have received a far larger number of submission than we could accept for presentation. This year we have received 27 full paper submissions and 49 shared task submissions. In total WMT 2014 featured 12 full paper oral presentations and 49 shared task poster presentations.The invited talk was given by Alon Lavie (Carnegie Mellon University and Safaba Translation Solutions, Inc.) entitled "Machine Translation in Academia and in the Commercial World -a Contrastive Perspective".We would like to thank the members of the Program Committee for their timely reviews. We also would like to thank the participants of the shared task and all the other volunteers who helped with the evaluations. is a ranking of the systems that participated in its shared translation tasks, produced by aggregating pairwise sentencelevel comparisons collected from human judges. Over the past few years, there have been a number of tweaks to the aggregation formula in attempts to address issues arising from the inherent ambiguity and subjectivity of the task, as well as weaknesses in the proposed models and the manner of model selection.We continue this line of work by adapting the TrueSkill TM algorithm -an online approach for modeling the relative skills of players in ongoing competitions, such as Microsoft's Xbox Live -to the human evaluation of machine translation output. Our experimental results show that TrueSkill outperforms other recently proposed models on accuracy, and also can significantly reduce the number of pairwise annotations that need to be collected by sampling non-uniformly from the space of system competitions. IntroductionThe Workshop on Statistical Machine Translation (WMT) has long been a central event in the machine translation (MT) community for the evaluation of MT output. It hosts...

show abstract

N-gram posterior probability confidence measures for statistical machine translation: an empirical study

Cited by 13 publications

References 26 publications

Confidence-based Rewriting of Machine Translation Output

Confidence-based Rewriting of Machine Translation Output

LIMSI's Contribution to the WMT'16 Biomedical Translation Task

Proceedings of the Ninth Workshop on Statistical Machine Translation

Contact Info

Product

Resources

About