2013
DOI: 10.5121/ijdkp.2013.3401
|View full text |Cite
|
Sign up to set email alerts
|

Effective Arabic Stemmer Based Hybrid Approach for Arabic Text Categorization

Abstract: Text pre-processing of Arabic Language is a challenge and crucial stage in Text Categorization (TC)particularly and Text Mining (TM) generally. Stemming algorithms can be employed in Arabic text pre-processing to reduces words to their stems/or root.Arabic stemming algorithms can be ranked, accordingto three category, as root-based approach (ex. Khoja); stem-based approach (ex. Larkey); and statisticalapproach (ex. N-Garm).However, no stemming of this language is perfect: The existing stemmers have asmall effi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 24 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…This was created by Khoja to solve the problems raised by the stemming tool (root extraction), [2], [6], [22]. According to some of previous studies, light stemming is a good stemmer for Arabic text [2], [6], [7].…”
Section: ) Feature Exteractionmentioning
confidence: 99%
“…This was created by Khoja to solve the problems raised by the stemming tool (root extraction), [2], [6], [22]. According to some of previous studies, light stemming is a good stemmer for Arabic text [2], [6], [7].…”
Section: ) Feature Exteractionmentioning
confidence: 99%
“…Recent years, a lot of non-English mixed stemming methods are proposed, for example, a hybrid algorithm for Polish proposed by Dawid Weis [19], hybrid algorithm for Nepali stemming proposed by Chiranjibi Sitaula [20], hybrid inflectional stemmer for Gujarati proposed by Suba, Jiandani and Bhattacharyya [21], hybrid stemmer for Arabic proposed by Hadni, Ouatik and Lachkar [22 ].…”
Section: Mixed Methodsmentioning
confidence: 99%
“…Recently, Hadni et al team [7] presents an Effective Arabic Stemmer Based Hybrid Approach for Arabic Text Categorization. Note that, in any Text Categorization system the center point is the document and its representation that may impact positively or negatively on the accuracy of the system.…”
Section: Related Workmentioning
confidence: 99%
“…Figure .1 Architecture of TC System of predefined categories [6]. Several Text Categorization Systems have been conducted for yet very little researches have been done out for the Arabic Text Categorization [7]. Arabic language is a highly inflected language and it requires a processing to be manipulated, it is a Semitic language that has a very complex ared with English.…”
Section: Introductionmentioning
confidence: 99%