Proceedings of the 2nd International Conference on Telecommunications and Communication Engineering 2018
DOI: 10.1145/3291842.3291900
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Feature Representation Based on the Effect of Words Frequency for Arabic Documents Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…It is the process of removing words in the sentences that do not hold any important meaning, for example, for ‫ال‬ ‫ج‬ ‫ل‬ , which means "so." e researcher in Arabic mentions a list of words in [58,59].…”
Section: Stop Word Eliminationmentioning
confidence: 99%
“…It is the process of removing words in the sentences that do not hold any important meaning, for example, for ‫ال‬ ‫ج‬ ‫ل‬ , which means "so." e researcher in Arabic mentions a list of words in [58,59].…”
Section: Stop Word Eliminationmentioning
confidence: 99%
“…The words and the frequency counts of these vectors. 10 Reference 11 offered a text classification ensemble approach, in which BOW was utilized for the extraction of features, and used the Naive Bayes classifier, linear discriminant analysis, and support vector machine; the results were promising.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The outcomes demonstrated that the SVM classifier gathered with ARLStem stemmer outperforms other classifiers when increasing the features. Moreover, in [49], the authors studied the effects of stop word removal in several classifiers and feature extractions using the CNN public dataset. Chi2 was applied as a feature selection method.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Secondly, normalization transforms the characters into a standard format, remove (all non-Arabic characters, diacritics, numbers, and punctuation). Then, 1057 stop words lists were prepared to be removed from all the documents [49]. Finally, stemming was applied by reducing the word into their root/stem [50] using a novel stemmer [51].…”
Section: Preprocessingmentioning
confidence: 99%