2016
DOI: 10.21608/ijci.2016.33954
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study for Arabic Text Classification Based on BOW and Mixed Words Representations

Abstract: This paper compares two methods for features representation in Arabic text classification. These methods are bag of words (BOW) that mean the word-level unigram and mixed words representations. The mixed words use a mixture of a bag of words and two adjacent words with different proportions. The main objective of this paper is to measure the accuracy of each method and to determine which method is more accurate for Arabic text classification based on the representation modes. Each method uses normalization and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…The authors, in [18], compare two feature extraction techniques in Arabic text classification: BoW and mixedwords. The results show that the mixed-words technique achieves the highest accuracy when normalization is used.…”
Section: Related Workmentioning
confidence: 99%
“…The authors, in [18], compare two feature extraction techniques in Arabic text classification: BoW and mixedwords. The results show that the mixed-words technique achieves the highest accuracy when normalization is used.…”
Section: Related Workmentioning
confidence: 99%