COLING-02 on Machine Translation in Asia - 2002
DOI: 10.3115/1118794.1118805
|View full text |Cite
|
Sign up to set email alerts
|

Identifying synonymous expressions from a bilingual corpus for example-based machine translation

Abstract: Example-based machine translation (EBMT) is based on a bilingual corpus. In EBMT, sentences similar to an input sentence are retrieved from a bilingual corpus and then output is generated from translations of similar sentences. Therefore, a similarity measure between the input sentence and each sentence in the bilingual corpus is important for EBMT. If some similar sentences are missed from retrieval, the quality of translations drops. In this paper, we describe a method to acquire synonymous expressions from … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2003
2003
2005
2005

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 3 publications
0
2
0
Order By: Relevance
“…Shimohata et al [13] extracted lexical paraphrases based on the substitution operation of edit operations. Text pairs having more than three edit distances are excluded from extraction.…”
Section: Lexical Paraphrase Extraction From Mctmentioning
confidence: 99%
“…Shimohata et al [13] extracted lexical paraphrases based on the substitution operation of edit operations. Text pairs having more than three edit distances are excluded from extraction.…”
Section: Lexical Paraphrase Extraction From Mctmentioning
confidence: 99%
“…(2) Finch et al (2002) cluster sentences in a handcrafted paraphrase corpus (Sugaya et al, 2002) to obtain pairs that are similar to each other for training SMT models, then by using the models the decoder generates a paraphrase. The experimental results indicate that (i) the EBMT based on normalization had increased coverage (Shimohata et al, 2002b) and (ii) the SMT created on the normalized sentences had a reduced word-error-rate (Watanabe et al, 2002a). Imamura et al (2003) proposed a calculation that measures the literalness of a translation pair and called it TCR.…”
Section: Paraphrasing and Filteringmentioning
confidence: 99%