Fifth International Conference on Information Technology: New Generations (Itng 2008) 2008
DOI: 10.1109/itng.2008.103
|View full text |Cite
|
Sign up to set email alerts
|

A Suffix Based Part-of-Speech Tagger for Turkish

Abstract: In this paper, we present a stochastic part-of-speech tagger for Turkish. The tagger is primarily developed for information retrieval purposes, but it can as well serve as a light-weight PoS tagger for other purposes. The tagger uses a well-established Hidden Markov model of the language with a closed lexicon that consists of fixed number of letters from the word endings. We have considered seven different lengths of word endings against 30 training corpus sizes. Bestcase accuracy obtained is 90.2% with 5 char… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 24 publications
0
8
0
Order By: Relevance
“…We believe that morphological features of the context words will be also informative in morpheme tagging task because Eryigit et al [9] shows that using Table 8. Comparison of our model with suffix based tagger [19] and the perceptron algorithm [16] on the datasets obtained from Metu Sabancı Turkish Treebank.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…We believe that morphological features of the context words will be also informative in morpheme tagging task because Eryigit et al [9] shows that using Table 8. Comparison of our model with suffix based tagger [19] and the perceptron algorithm [16] on the datasets obtained from Metu Sabancı Turkish Treebank.…”
Section: Discussionmentioning
confidence: 99%
“…Train Set Size Test Set Size Accuracy Suffix-based tagger [19] 5025 1017 84.25% Perceptron [16] 5025 1017 85.15% HMM with the last morpheme tag 5025 1017 88.98% Suffix-based tagger [19] 18205 1017 88.90% Perceptron [16] 18205 1017 86.71% HMM with the last morpheme tag 18205 1017 90.95%…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Before starting the analysis, the word is PoS tagged within the context of a sentence using suffix-based hidden Markov model [27]. In this example, we assume that PoS tag of the word is noun.…”
Section: Stripping Off the Suffixesmentioning
confidence: 99%