Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2018
DOI: 10.18653/v1/p18-2054
|View full text |Cite
|
Sign up to set email alerts
|

Improving Beam Search by Removing Monotonic Constraint for Neural Machine Translation

Abstract: To achieve high translation performance, neural machine translation models usually rely on the beam search algorithm for decoding sentences. The beam search finds good candidate translations by considering multiple hypotheses of translations simultaneously. However, as the algorithm searches in a monotonic left-to-right order, a hypothesis can not be revisited once it is discarded. We found such monotonicity forces the algorithm to sacrifice some decoding paths to explore new paths. As a result, the overall qu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…This corresponds to searching with a non-admissible heuristic (Hart et al, 1968), that is, a heuristic that may underestimate the likelihood of completing a translation. This biased search affects statistics of beam search outputs in unknown ways and may well account for some of the pathologies of Section 2, and has motivated variants of the algorithm aimed at comparing partial translations more fairly (Huang et al, 2017;Shu and Nakayama, 2018). This problem has also been studied in parsing literature, where it's known as imbalanced probability search bias (Stanojević and Steedman, 2020).…”
Section: Nmt and Its Many Biasesmentioning
confidence: 99%
“…This corresponds to searching with a non-admissible heuristic (Hart et al, 1968), that is, a heuristic that may underestimate the likelihood of completing a translation. This biased search affects statistics of beam search outputs in unknown ways and may well account for some of the pathologies of Section 2, and has motivated variants of the algorithm aimed at comparing partial translations more fairly (Huang et al, 2017;Shu and Nakayama, 2018). This problem has also been studied in parsing literature, where it's known as imbalanced probability search bias (Stanojević and Steedman, 2020).…”
Section: Nmt and Its Many Biasesmentioning
confidence: 99%
“…Previous non-monotonic methods (Serdyuk et al, 2018;Zhang et al, 2018;Zhou et al, 2019a,b;Zhang et al, 2019;Welleck et al, 2019) jointly leverage L2R and R2L information. Non-monotonic methods are also widely used in many tasks (Huang et al, 2018;Shu and Nakayama, 2018), such as parsing (Goldberg and Elhadad, 2010), image caption (Mehri and Sigal, 2018), and dependency parsing (Kiperwasser and Goldberg, 2016;. Similarly, insertion-based method (Gu et al, 2019;Stern et al, 2019) predicts the next token and its position to be inserted.…”
Section: Related Workmentioning
confidence: 99%
“…Naive beam search with log-probabilities has several known drawbacks, for example, favoring short translations and its monotonic constraint. Hence, many regularization/rescoring methods [2,27,8,28,16] or beam search variants [6,21] are proposed to improve the performance of beam search. Other than beam search, one promising MAP decoding technique for evaluation is the DFS-based exact search [22], which is designed to find the mode of model distributions.…”
Section: Related Workmentioning
confidence: 99%