2009
DOI: 10.3844/jcssp.2009.794.800
|View full text |Cite
|
Sign up to set email alerts
|

Statistical Part-of-Speech Tagger for Traditional Arabic Texts

Abstract: Problem statement:This study presented the development of an Arabic part-of-speech tagger that can be used for analyzing and annotating traditional Arabic texts, especially the Quran text. Approach: It is a part of a project related to the computerization of the Holy Quran. One of the main objectives in this project was to build a textual corpus of the Holy Quran. Results: Since an appropriate textual version of the Holy Quran was prepared and morphologically analyzed in other stages of this project, we focuse… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 5 publications
0
6
0
Order By: Relevance
“…The problems of Arabic studies in POS tagging are as follows (El-Hadj, 2009;Al Gahtani et al, 2009):…”
Section: Pos Tagging Approaches Used For Arabicmentioning
confidence: 99%
“…The problems of Arabic studies in POS tagging are as follows (El-Hadj, 2009;Al Gahtani et al, 2009):…”
Section: Pos Tagging Approaches Used For Arabicmentioning
confidence: 99%
“…Finally, it builds an HMM-based model of Arabic POS tags, which will be trained on the annotated corpus. Alhadj et al [30] propose a new method of part-of-speech tagger that can be used for analyzing and annotating traditional Arabic texts, especially the Holy Quran text.…”
Section: Statistical Approachmentioning
confidence: 99%
“…Part-Of-Speech (POS) tagging is assigning a specific tag to each word of a sentence to indicate its function in the specific context [1]. POS tagging is considered as one of the basic components necessary for any robust Natural Language Processing (NLP) infrastructure [2], and it is needed in many tasks such as syntax and semantic analysis, text to speech (TTS), natural language parsing, information retrieval (IR), information extraction (IE), and machine translation (MT) [3]. A manually tagged corpus can be used for innumerable studies of word-frequency and POS.…”
Section: Introductionmentioning
confidence: 99%