2018 IEEE 12th International Conference on Application of Information and Communication Technologies (AICT) 2018
DOI: 10.1109/icaict.2018.8747021
|View full text |Cite
|
Sign up to set email alerts
|

Lexicon-free stemming for Kazakh language information retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 3 publications
0
1
0
1
Order By: Relevance
“…Misalnya, kata " cars, car's, car " memiliki bentuk dasar yang sama yaitu "car". Permasalahan utama dalam proses stemming adalah bagaimana cara memperoleh kata dasar yang benar dari suatu kata yang telah mengalami perubahan bentuk [2], [3].…”
Section: Pendahuluanunclassified
“…Misalnya, kata " cars, car's, car " memiliki bentuk dasar yang sama yaitu "car". Permasalahan utama dalam proses stemming adalah bagaimana cara memperoleh kata dasar yang benar dari suatu kata yang telah mengalami perubahan bentuk [2], [3].…”
Section: Pendahuluanunclassified
“…The first steps are removing URLs, punctuation, and lower-casing. The second step is ignoring stopwords [8] from the dataset where it is based on accuracy evaluation after generating the list of stop words using the TF-IDF algorithm; Then, we applied the stemming algorithm [7,9] which is based on Uzbek words' endings' electronic dictionary that uses combinatorial approach inferring apply for part of speech of the Uzbek language: nouns, adjectives, numerals, verbs, participles, moods, voices. Advantages of using the algorithm are lexicon-free and its complexity that allows one operation (referring to the dictionary of endings of the language) to perform: segmentation of the word into suffixes; performs morphological analysis of the word.…”
Section: Introductionmentioning
confidence: 99%