Synchronous binarization for machine translation

Zhang, Hao; Huang, Liang; Gildea, Daniel; Knight, Kevin

doi:10.3115/1220835.1220868

Cited by 57 publications

(76 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Zhang et al (2006) introduced a synchronous binarization technique that improved decoding efficiency and accuracy by ensuring that rule binarization avoided gaps on both the source and target sides (for rules where this was possible). Their binarization was designed to share binarized pieces among rules, but their approach to distributing weight was the default (nondiffused) case found in this paper to be least efficient: The entire weight of the original rule is placed at the top binarized rule and all internal rules are assigned a probability of 1.0.…”

Section: Discussionmentioning

confidence: 99%

“…Increasing sharing reduces the amount of state that the parser must explore. Binarization has also been investigated in the context of parsing-based approaches to machine translation, where it has been shown that paying careful attention to the binarization scheme can produce much faster decoders (Zhang et al, 2006;Huang, 2007;DeNero et al, 2009).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Weight pushing and binarization for fixed-grammar parsing

Post

Gildea

2009

Proceedings of the 11th International Conference on Parsing Technologies - IWPT '09

Self Cite

View full text Add to dashboard Cite

We apply the idea of weight pushing (Mohri, 1997) to CKY parsing with fixed context-free grammars. Applied after rule binarization, weight pushing takes the weight from the original grammar rule and pushes it down across its binarized pieces, allowing the parser to make better pruning decisions earlier in the parsing process. This process can be viewed as generalizing weight pushing from transducers to hypergraphs. We examine its effect on parsing efficiency with various binarization schemes applied to tree substitution grammars from previous work. We find that weight pushing produces dramatic improvements in efficiency, especially with small amounts of time and with large grammars.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Weight pushing and binarization for fixed-grammar parsing

Post

Gildea

2009

Proceedings of the 11th International Conference on Parsing Technologies - IWPT '09

Self Cite

View full text Add to dashboard Cite

show abstract

“…For ease of presentation, and following synchronous-grammar based MT practice, we will henceforth restrict our focus to binary grammars (Zhang et al, 2006;Wang et al, 2007).…”

Section: Undirected Machine Translationmentioning

confidence: 99%

Undirected Machine Translation with Discriminative Reinforcement Learning

Gesmundo

Henderson

2014

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

We present a novel Undirected Machine Translation model of Hierarchical MT that is not constrained to the standard bottomup inference order. Removing the ordering constraint makes it possible to condition on top-down structure and surrounding context. This allows the introduction of a new class of contextual features that are not constrained to condition only on the bottom-up context. The model builds translation-derivations efficiently in a greedy fashion. It is trained to learn to choose jointly the best action and the best inference order. Experiments show that the decoding time is halved and forestrescoring is 6 times faster, while reaching accuracy not significantly different from state of the art.

show abstract

“…In Table 2, dot column stands for artificial anchor points in SL sentence, Lw and Rw for previous word and successive word of the current one respectively, and P, LHS, Lw, RHS and Rw constitute the syntactic reordering features of our model. Notice that, inspired by [1] and [11], we assume SL parse trees are binarized before fed into the tree-to-string transformation algorithm. [1] suggests binary-branching ITG rules prune seemingly unlikely and arbitrary word permutations but yet, at the same time, accommodate most meaningful structural reversals during translation.…”

Section: Tree-to-string Transformation Algorithmmentioning

confidence: 99%

Lexicalized Syntactic Reordering Framework for Word Alignment and Machine Translation

Huang

Chen

Chang

2009

Computer Processing of Oriental Languages. Language Technology for the Knowledge-Based Economy

View full text Add to dashboard Cite

Abstract. We propose a lexicalized syntactic reordering framework for crosslanguage word aligning and translating researches. In this framework, we first flatten hierarchical source-language parse trees into syntactically-motivated linear string representations, which can easily be input to many feature-like probabilistic models. During model training, these string representations accompanied with target-language word alignment information are leveraged to learn systematic similarities and differences in languages' grammars. At runtime, syntactic constituents of source-language parse trees will be reordered according to automatically acquired lexicalized reordering rules in previous step, to closer match word orientations of the target language. Empirical results show that, as a preprocessing component, bilingual word aligning and translating tasks benefit from our reordering methodology.

show abstract

Synchronous binarization for machine translation

Cited by 57 publications

References 11 publications

Weight pushing and binarization for fixed-grammar parsing

Weight pushing and binarization for fixed-grammar parsing

Undirected Machine Translation with Discriminative Reinforcement Learning

Lexicalized Syntactic Reordering Framework for Word Alignment and Machine Translation

Contact Info

Product

Resources

About