Proceedings of the Ninth Workshop on Statistical Machine Translation 2014
DOI: 10.3115/v1/w14-3346
|View full text |Cite
|
Sign up to set email alerts
|

A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU

Abstract: BLEU is the de facto standard machine translation (MT) evaluation metric. However, because BLEU computes a geometric mean of n-gram precisions, it often correlates poorly with human judgment on the sentence-level.Therefore, several smoothing techniques have been proposed. This paper systematically compares 7 smoothing techniques for sentence-level BLEU. Three of them are first proposed in this paper, and they correlate better with human judgments on the sentence-level than other smoothing techniques. Moreover,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
106
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 166 publications
(112 citation statements)
references
References 5 publications
0
106
0
Order By: Relevance
“…Fig. 7 shows the evaluation results for Meteor and BLEU with smoothing techniques proposed by Chen and Cherry [5]. These results also report the score as 1.0 for the exact similar answer sentences as noticed in Fig.…”
Section: Automatic Evaluationmentioning
confidence: 58%
See 4 more Smart Citations
“…Fig. 7 shows the evaluation results for Meteor and BLEU with smoothing techniques proposed by Chen and Cherry [5]. These results also report the score as 1.0 for the exact similar answer sentences as noticed in Fig.…”
Section: Automatic Evaluationmentioning
confidence: 58%
“…In general the rest of the smoothing techniques correlate with others in an acceptable level, for instance, Table 9 A brief introduction to smoothing algorithms used with BLEU. The last three smoothing techniques are first proposed by Chen and Cherry [5]. These new ones are mostly formed by modifying the traditional ones shown in S1 to S4.…”
Section: Automatic Evaluationmentioning
confidence: 99%
See 3 more Smart Citations