Proceedings of the 2nd International Conference on Big Data, Cloud and Applications 2017
DOI: 10.1145/3090354.3090371
|View full text |Cite
|
Sign up to set email alerts
|

Developing and performance evaluation of a new Arabic heavy/light stemmer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 24 publications
0
5
0
Order By: Relevance
“…In the second step, the root frequency map is constructed to derive the possible stem by minimizing the frequency map of roots. Zeroual [53] used pattern-matching techniques to derive the stem of the Arabic word. For instance, the words ‫ﮐتﺐ‬ [kataba/to write], ‫ﮐاتﺐ‬ [kaatib/writer], ‫مکتوب‬ [makotuwb/written],…”
Section: Related Workmentioning
confidence: 99%
“…In the second step, the root frequency map is constructed to derive the possible stem by minimizing the frequency map of roots. Zeroual [53] used pattern-matching techniques to derive the stem of the Arabic word. For instance, the words ‫ﮐتﺐ‬ [kataba/to write], ‫ﮐاتﺐ‬ [kaatib/writer], ‫مکتوب‬ [makotuwb/written],…”
Section: Related Workmentioning
confidence: 99%
“…After reviewing and filtrating the compiled lists, we created a new list of roughly 1,000 domain-independent stop words. Then, we generated their inflected forms following a proposed technique that involves 123 Arabic clitics [18]. As a result, the final list comprises 11,403 stop words.…”
Section: Stop Words Eliminationmentioning
confidence: 99%
“…They executed some experiments and evaluated the performance of the developed algorithm based on two styles of Arabic; Classical Arabic and Modern Standard Arabic. The outputs of the stemmer organized in three classes include the stem, a unique root, and a combined class from the root and stem [24].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Heavy/light Stemmer [24] Stemmer algorithm based on Arabic's morphological features. 96.9% Stemming Algorithm [22] Based on a set of rules and a file containing Arabic roots, the authors extract roots from input texts.…”
Section: %mentioning
confidence: 99%