2016
DOI: 10.18038/btda.31812
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid Statistical Approach to Stemming in Turkish: An Agglutinative Language

Abstract: Finding Stem is a complicated and important issue for agglutinative languages like Turkish where theoretically infinite number of surface forms can be obtained from a single lexeme. Both analytical and statistical approaches have been tried for stemming Turkish words. Two main problems that become apparent with these approaches are the involvement of a dictionary which enforces the assumption of closed vocabulary and the disambiguation of the actual stem among the numerous candidates. Here, we present a method… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…According to the study, it is possible to derive about 1.5 million different words from a Noun [masa (table)] and from a Verb [oku (read)] only with the use of derivational morphemes [40]. The morphological structure of Turkish word is shown in Figure 1 [41].…”
Section: Turkish Language Modelling Challenges Based On Its Morphological Complexitymentioning
confidence: 99%
See 2 more Smart Citations
“…According to the study, it is possible to derive about 1.5 million different words from a Noun [masa (table)] and from a Verb [oku (read)] only with the use of derivational morphemes [40]. The morphological structure of Turkish word is shown in Figure 1 [41].…”
Section: Turkish Language Modelling Challenges Based On Its Morphological Complexitymentioning
confidence: 99%
“…Some samples for morphological productivity of Turkish language are provided in Table 1 [41]. As it is obvious from Table 1, the number of suffixes and their imaginable combinations that can be added to a word generate a serious language analysis problem to obtain actual stem from possible derivations.…”
Section: Turkish Language Modelling Challenges Based On Its Morphological Complexitymentioning
confidence: 99%
See 1 more Smart Citation