2020
DOI: 10.1162/tacl_a_00346
|View full text |Cite
|
Sign up to set email alerts
|

Best-First Beam Search

Abstract: Decoding for many NLP tasks requires an effective heuristic algorithm for approximating exact search because the problem of searching the full output space is often intractable, or impractical in many settings. The default algorithm for this job is beam search—a pruned version of breadth-first search. Quite surprisingly, beam search often returns better results than exact inference due to beneficial search bias for NLP tasks. In this work, we show that the standard implementation of beam search can be made up … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
35
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
3

Relationship

1
9

Authors

Journals

citations
Cited by 49 publications
(35 citation statements)
references
References 27 publications
0
35
0
Order By: Relevance
“…We hope insights derived from our study stimulate research on tighter integrations between the benefits of cascaded and end-to-end sequence models. Exploiting searchable intermediates through beam search is just the tip of the iceberg for search algorithms, as numerous approximate search techniques like diverse beam search (Vijayakumar et al, 2018) and best-first beam search (Meister et al, 2020) have been recently proposed to improve diversity and approximation of the most-likely sequence. Incorporating differentiable lattice based search (Hannun et al, 2020) can also allow the subsequent sub-net to digest n-best representations.…”
Section: Discussionmentioning
confidence: 99%
“…We hope insights derived from our study stimulate research on tighter integrations between the benefits of cascaded and end-to-end sequence models. Exploiting searchable intermediates through beam search is just the tip of the iceberg for search algorithms, as numerous approximate search techniques like diverse beam search (Vijayakumar et al, 2018) and best-first beam search (Meister et al, 2020) have been recently proposed to improve diversity and approximation of the most-likely sequence. Incorporating differentiable lattice based search (Hannun et al, 2020) can also allow the subsequent sub-net to digest n-best representations.…”
Section: Discussionmentioning
confidence: 99%
“…To perform bidirectional decoding, we input both of the sentence start tokens to provide more contextual information for the decoder. The specific method, based on that in [34], is illustrated in Algorithm 1. The goal is to search for the best and most likely transcription based on the source sequence.…”
Section: Character Decodingmentioning
confidence: 99%
“…Rush et al [37] present a variant of beam search for syntax-and phrase-based MT that comes with guarantees a bound on the possible decoding error and is faster. Meister et al [57] develop a generic reformulation of beam search as agenda-based best-first search. Their implementation is faster than standard implementations and is shown to return the top hypothesis the first time it encounters a complete hypothesis.…”
Section: Algorithmmentioning
confidence: 99%