Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05 2005
DOI: 10.3115/1220575.1220635
|View full text |Cite
|
Sign up to set email alerts
|

Context-based morphological disambiguation with random fields

Abstract: Finite-state approaches have been highly successful at describing the morphological processes of many languages. Such approaches have largely focused on modeling the phone-or character-level processes that generate candidate lexical types, rather than tokens in context. For the full analysis of words in context, disambiguation is also required (Hakkani-Tür et al., 2000;Hajič et al., 2001). In this paper, we apply a novel source-channel model to the problem of morphological disambiguation (segmentation into mor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2006
2006
2020
2020

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 25 publications
(27 citation statements)
references
References 21 publications
0
27
0
Order By: Relevance
“…The method is also compared to an SVM based segmenter presented by Diab et al (2004) and shows improved results for small tasks but, again, no or only little improvement for large tasks. Nguyen and Vogel (2008) apply a Conditional Random Fields (CRF) segmentation method (as presented in Smith et al 2005) for Arabic-to-English translation. They show that a reduced morpheme segmentation, i.e.…”
Section: Related Workmentioning
confidence: 99%
“…The method is also compared to an SVM based segmenter presented by Diab et al (2004) and shows improved results for small tasks but, again, no or only little improvement for large tasks. Nguyen and Vogel (2008) apply a Conditional Random Fields (CRF) segmentation method (as presented in Smith et al 2005) for Arabic-to-English translation. They show that a reduced morpheme segmentation, i.e.…”
Section: Related Workmentioning
confidence: 99%
“…In subsequent work, Sutton et al [11] observe that jointly modeling noun phrase (NP) segmentation and POS tagging using DCRFs improves POS prediction accuracies. POS tagging using CRFs has also been applied to languages such as Chinese [77], Arabic, Czech [78], and Japanese [79], with complex morphological structures, and they have been found to outperform their generative counterparts.…”
Section: Part Of Speech Tagging S: Part Of Speech Tags O: Words In a mentioning
confidence: 99%
“…The standard solution to constrain the probabilistic tagging model for some of the unseen items is the application of MA (Hakkani-Tür et al, 2000;Hajič et al, 2001;Smith et al, 2005). Here a distinction must be made between those items that are not found in the training corpus (these we have called unseen tokens) and those that are not known to the MA -we call these out of vocabulary (OOV).…”
Section: The Disambiguation Of Morphological Analysesmentioning
confidence: 99%