2017
DOI: 10.1162/tacl_a_00075
|View full text |Cite
|
Sign up to set email alerts
|

Phrase Table Induction Using In-Domain Monolingual Data for Domain Adaptation in Statistical Machine Translation

Abstract: We present a new framework to induce an in-domain phrase table from in-domain monolingual data that can be used to adapt a general-domain statistical machine translation system to the targeted domain. Our method first compiles sets of phrases in source and target languages separately and generates candidate phrase pairs by taking the Cartesian product of the two phrase sets. It then computes inexpensive features for each candidate phrase pair and filters them using a supervised classifier in order to induce an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…Empirical results show that having an in-domain monolingual corpus could substantially improve translation quality, especially with in-domain monolingual data on the target side [111]. There are other ways of adapting translation models with monolingual corpora with different degrees of success [112][113][114][115].…”
Section: Off-line Adaptationmentioning
confidence: 99%
“…Empirical results show that having an in-domain monolingual corpus could substantially improve translation quality, especially with in-domain monolingual data on the target side [111]. There are other ways of adapting translation models with monolingual corpora with different degrees of success [112][113][114][115].…”
Section: Off-line Adaptationmentioning
confidence: 99%