Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing 2015
DOI: 10.18653/v1/d15-1163
|View full text |Cite
|
Sign up to set email alerts
|

Improving Statistical Machine Translation with a Multilingual Paraphrase Database

Abstract: The multilingual Paraphrase Database (PPDB) is a freely available automatically created resource of paraphrases in multiple languages. In statistical machine translation, paraphrases can be used to provide translation for out-of-vocabulary (OOV) phrases. In this paper, we show that a graph propagation approach that uses PPDB paraphrases can be used to improve overall translation quality. We provide an extensive comparison with previous work and show that our PPDB-based method improves the BLEU score by up to 1… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 37 publications
0
10
0
Order By: Relevance
“…It is a fundamental semantic relation in human language, as formalized in the Meaning-Text linguistic theory which defines meaning as 'invariant of paraphrases' (Milićević, 2006). Researchers have shown benefits of using paraphrases in a wide range of applications (Madnani and Dorr, 2010), including question answering (Fader et al, 2013), semantic parsing (Berant and Liang, 2014), information extraction (Sekine, 2006;Zhang et al, 2015), machine translation (Mehdizadeh Seraj et al, 2015), textual entailment (Dagan et al, 2006;Bjerva et al, 2014;Marelli et al, 2014;Izadinia et al, 2015), vector semantics (Faruqui et al, 2015;Wieting et al, 2015), and semantic textual similarity (Agirre et al, 2015;Li and Srikumar, 2016). Studying paraphrases in Twitter can also help track unfolding events (Vosoughi and Roy, 2016) or the spread of information (Bakshy et al, 2011) on social networks.…”
Section: Introductionmentioning
confidence: 99%
“…It is a fundamental semantic relation in human language, as formalized in the Meaning-Text linguistic theory which defines meaning as 'invariant of paraphrases' (Milićević, 2006). Researchers have shown benefits of using paraphrases in a wide range of applications (Madnani and Dorr, 2010), including question answering (Fader et al, 2013), semantic parsing (Berant and Liang, 2014), information extraction (Sekine, 2006;Zhang et al, 2015), machine translation (Mehdizadeh Seraj et al, 2015), textual entailment (Dagan et al, 2006;Bjerva et al, 2014;Marelli et al, 2014;Izadinia et al, 2015), vector semantics (Faruqui et al, 2015;Wieting et al, 2015), and semantic textual similarity (Agirre et al, 2015;Li and Srikumar, 2016). Studying paraphrases in Twitter can also help track unfolding events (Vosoughi and Roy, 2016) or the spread of information (Bakshy et al, 2011) on social networks.…”
Section: Introductionmentioning
confidence: 99%
“…Multilingual Machine Translation. The multilingual MT enjoys a rich research history, dating back to the age of statistical machine translation (Gao et al, 2002;Haffari and Sarkar, 2009;Seraj et al, 2015). In recent years, the prosperity of neural machine translation (NMT) has led to the growing prominence and popularity of multilingual MT systems.…”
Section: Related Workmentioning
confidence: 99%
“…Multilingual Machine Translation. The multilingual MT enjoy a rich research history, dating back to the age of statistical machine translation (Gao et al, 2002;Haffari and Sarkar, 2009;Seraj et al, 2015). In recent year, the prosperity of neural machine translation (NMT) has led to the growing prominent and popularity of multilingual MT systems, and the encoder-decoder framework has made the de facto standard for NMT (Bahdanau et al, 2015;Vaswani et al, 2017).…”
Section: Related Workmentioning
confidence: 99%