2008
DOI: 10.1007/s10590-008-9044-3
|View full text |Cite
|
Sign up to set email alerts
|

Using target-language information to train part-of-speech taggers for machine translation

Abstract: Although corpus-based approaches to machine translation (MT) are growing in interest, they are not applicable when the translation involves lessresourced language pairs for which there are no parallel corpora available; in those cases, the rule-based approach is the only applicable solution. Most rule-based MT systems make use of part-of-speech (PoS) taggers to solve the PoS ambiguities in the source-language texts to translate; those MT systems require accurate PoS taggers to produce reliable translations in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
4
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…The development of such transfer rules requires qualified people to code them manually; therefore, their automatic inference may save part of this human effort. The method we present is entirely unsupervised and benefits from information in the rest of modules of the MT system in which the inferred rules are applied, in line with the method proposed by Sánchez-Martínez et al (2008) to train part-of-speech taggers in an unsupervised way for their use in MT.…”
Section: Overviewmentioning
confidence: 99%
“…The development of such transfer rules requires qualified people to code them manually; therefore, their automatic inference may save part of this human effort. The method we present is entirely unsupervised and benefits from information in the rest of modules of the MT system in which the inferred rules are applied, in line with the method proposed by Sánchez-Martínez et al (2008) to train part-of-speech taggers in an unsupervised way for their use in MT.…”
Section: Overviewmentioning
confidence: 99%
“…Candidates for a starting point for our development could also have been one of the following tools: TriTagger [12,11], an open source trigram HMM tagger implemented in Java, has a language specific integrated morphological analyser (for Icelandic) -called IceMorphy [10] -but it does not perform lemmatization and lacks a few clever tricks implemented in HunPos, which seem to boost the performance of that tagger significantly. Another open source trigram HMM tagger [18,17] implemented in C/C++ is included in the Apertium [1] rule based machine translation toolset. It performs full morphological disambiguation by taking morphologically analyzed input and disambiguating it.…”
Section: Introductionmentioning
confidence: 99%
“…improves human-to-human communication, enables human-to-machine communication by doing useful processing of texts or speeches. Part-of-speech (POS) tagging is one of the most important addressed areas and main building block and application in the natural language processing discipline [1][2][3]. So, Part of Speech (POS) Tagging is a notable NLP topic that aims in assigning each word of a text the proper syntactic tag in its context of appearance [4][5][6][7][8].…”
mentioning
confidence: 99%