2008
DOI: 10.1007/s10590-008-9048-z
|View full text |Cite
|
Sign up to set email alerts
|

METIS-II: low resource machine translation

Abstract: METIS-II was an EU-FET MT project running from October 2004 to September 2007, which aimed at translating free text input without resorting to parallel corpora. The idea was to use "basic" linguistic tools and representations and to link them with patterns and statistics from the monolingual target-language corpus. The METIS-II project has four partners, translating from their "home" languages Greek, Dutch, German, and Spanish into English. The paper outlines the basic ideas of the project, their implementatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 11 publications
(12 reference statements)
0
3
0
Order By: Relevance
“…Other strategies are the crowdsourcing of additional parallel data, or the use of large monolingual and comparable corpora to perform bilingual lexicon induction before training an MT system (Klementiev et al, 2012;Irvine and Callison-Burch, 2013;Irvine and Callison-Burch, 2016). The METIS-II EU project replaced the need for parallel corpora by using linguistic pre-processing and statistics from target-language corpora only (Carl et al, 2008). In a recent study applied to Afrikaans-to-Dutch translation, the authors use a characterbased "cipher model" and a word-based language model to design a decoder for the low-resourced input language (Pourdamghani and Knight, 2017).…”
Section: Monolingual Resourcesmentioning
confidence: 99%
“…Other strategies are the crowdsourcing of additional parallel data, or the use of large monolingual and comparable corpora to perform bilingual lexicon induction before training an MT system (Klementiev et al, 2012;Irvine and Callison-Burch, 2013;Irvine and Callison-Burch, 2016). The METIS-II EU project replaced the need for parallel corpora by using linguistic pre-processing and statistics from target-language corpora only (Carl et al, 2008). In a recent study applied to Afrikaans-to-Dutch translation, the authors use a characterbased "cipher model" and a word-based language model to design a decoder for the low-resourced input language (Pourdamghani and Knight, 2017).…”
Section: Monolingual Resourcesmentioning
confidence: 99%
“…Irvine and Callison-Burch using the combination of parallel and comparable corpus, and gain between 0.5 and 1.7 points of bleu scores improvement in some Hindi local languages into English (Irvine & Callison-Burch, 2013). While Michael Carl shows an improvement by using lemma and PoS Tag in Spanish, Dutch and Greece languages translation into English (Carl, et al, 2008), and Jeff MA modifies word alignment by using clustering and succeed to handle more than 90% of loss resulting from poor word alignment caused by small training corpus in Chinese into English translation (MA, Matsoukas, & Schwartz, 2011). While in this paper, by considering the absence of Sundanese tool and resources to generate PoS, lemma, and other linguistics features, we prefer to use the PoS label tagged manually to get a better translation result in PBMT.…”
Section: Related Workmentioning
confidence: 99%
“…Other pioneer hybridizations include: the combination of translation memories with rule‐based MT (Heyn, ), the combination of example‐based MT and rule‐based MT, and the combination of translation memories and example‐based MT (Carl & Hansen, ). More advanced hybrid techniques include the combination of rule‐based and several types of corpus‐based approaches (Carl et al., ). Finally, most recent works on hybridization tend to involve statistical systems with linguistic knowledge (Costa‐jussà & Farrús, ), which seems to suggest that modern statistical MT may have a high influence on rule‐based MT.…”
Section: Introductionmentioning
confidence: 99%