2019
DOI: 10.1016/j.procs.2019.09.238
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Study of Arabic Stemming Algorithms for Topic Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 3 publications
0
4
0
2
Order By: Relevance
“…However, three recent comparative studies investigated different Arabic stemmers in the field of Arabic text classification. The first one [21] reported that the classification was more efficient with Tashaphyne light based stemmer 6 followed by Farasa [22], Khoja stemmer [8], Light10 [7], and finally Al Khalil Morph Sys [23]. What's more, this study found that the stemmer accuracy does not have an impact on the classifier efficiency in topic identification.…”
Section: Stemmingmentioning
confidence: 64%
“…However, three recent comparative studies investigated different Arabic stemmers in the field of Arabic text classification. The first one [21] reported that the classification was more efficient with Tashaphyne light based stemmer 6 followed by Farasa [22], Khoja stemmer [8], Light10 [7], and finally Al Khalil Morph Sys [23]. What's more, this study found that the stemmer accuracy does not have an impact on the classifier efficiency in topic identification.…”
Section: Stemmingmentioning
confidence: 64%
“…The stemming process is an important pre-processing phase that, depending on the language employed, might be considered a tough step to complete. The amount of morphological complexity of a language can impact stemming outcomes [25]. Because the NLTK library [26], which is used for the stemming process, does not currently support Indonesian, the stemming process in Indonesian is carried out using the Sastrawi library [27], which has proved to be fairly competent in handling the Indonesian language stemming process.…”
Section: Data Pre-processingmentioning
confidence: 99%
“…According to our review, the commonly stemmers that have been successfully implemented in Arabic topic identification [26][27][28] are ARLSTem, Tashaphyne light stemmer, Farasa, Khoja stemmer, Light10, Al Khalil Morph Sys, Assem's stemmer, Soori's stemmer, and ISRI stemmer.…”
Section: Stemmingmentioning
confidence: 99%