2012
DOI: 10.5120/7500-0634
|View full text |Cite
|
Sign up to set email alerts
|

DACS Dewey index-based Arabic Document Categorization System

Abstract: This paper is devoted to the development of Arabic Text Categorization System. First, a stop-words list is generated using statistical approach which captures the inflation of different Arabic words. Second, a feature representation model based on Hidden Markov Model is developed to extract roots and morphological weights. Third, a semantic synonyms merge technique is presented for feature reduction. Finally a Dewey-Index Based Back-propagation Artificial Neural Network is developed for Arabic Document Categor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Stop Words are words that have no semantic relationship with the context in which they are located. Therefore, they should not be included as indexing terms (Alajmi et al, 2012). Thus, Stop Words, which is a basic tool in the text mining process, is the process of removing frequently used words from the texts during data preprocessing.…”
Section: Discussionmentioning
confidence: 99%
“…Stop Words are words that have no semantic relationship with the context in which they are located. Therefore, they should not be included as indexing terms (Alajmi et al, 2012). Thus, Stop Words, which is a basic tool in the text mining process, is the process of removing frequently used words from the texts during data preprocessing.…”
Section: Discussionmentioning
confidence: 99%
“…We want to clarify some concepts to provide an autonomous document as much as possible. Empty words (a.k.a stop words) are those words that have no significant semantic relation to the context in which they exist (Alajmi et al , 2012). Since empty words do not provide meaning to a sentence, they should be deleted from the text.…”
Section: Methodsmentioning
confidence: 99%