2014
DOI: 10.12988/ces.2014.49152
|View full text |Cite
|
Sign up to set email alerts
|

Bilingual multi-word lexicon construction via a pivot language

Abstract: Bilingual multi-word lexicons are helpful for statistical machine translation systems to improve their performance. In this paper we present a method for constructing such lexicons in a resource-poor language pair such as Korean-French. By using two parallel corpora sharing one pivot language we can easily construct such lexicons without any external language resource like a seed dictionary. The experimental results for the KR to FR have shown that the accuracy is quite promising, even though this research is … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2016
2016

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 5 publications
0
2
0
Order By: Relevance
“…Based on these circumstances, we identified potential MWE candidates for source and target languages by using both linguistic and statistical information [19,20]. The identification method (see [11] for more details) can be described as follows:…”
Section: Mwe Identificationmentioning
confidence: 99%
See 1 more Smart Citation
“…Based on these circumstances, we identified potential MWE candidates for source and target languages by using both linguistic and statistical information [19,20]. The identification method (see [11] for more details) can be described as follows:…”
Section: Mwe Identificationmentioning
confidence: 99%
“…This is one of the disadvantages for extracting bilingual MWEs from parallel corpora. Under the circumstance, Seo et al [11] have proposed a novel method, denoted as the pivot context-based approach for multiwords (PCAM) in the rest of the paper, for extracting bilingual MWEs by using parallel corpora in a resource-poor language pair. However, PCAM has a weak point that targets constituents instead of complete translation equivalents, which might have high scores of similarity if their contexts are not enough.…”
Section: Introductionmentioning
confidence: 99%