2011
DOI: 10.1109/tasl.2010.2045240
|View full text |Cite
|
Sign up to set email alerts
|

A Stochastic Arabic Diacritizer Based on a Hybrid of Factorized and Unfactorized Textual Features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 52 publications
(45 citation statements)
references
References 5 publications
0
45
0
Order By: Relevance
“…Also, on ATB standard dataset, the proposed system achieves 0.9% improvement over the best result in literature using the same training and testing data same as evaluation in Rashwan et al (2011) was done.…”
Section: Comparison To Other Systemsmentioning
confidence: 84%
See 2 more Smart Citations
“…Also, on ATB standard dataset, the proposed system achieves 0.9% improvement over the best result in literature using the same training and testing data same as evaluation in Rashwan et al (2011) was done.…”
Section: Comparison To Other Systemsmentioning
confidence: 84%
“…There are many models for Arabic PoS tags. In this work we adopt the one in Rashwan et al (2011), which sets 62 context-free atomic units to represent all possible Arabic language PoS tags. A very rich dataset of Arabic words, extracted from different sources, is used to train the system (available on http://www.RDIeg.com/RDI/TrainingData is where to download TRN_DB_II).…”
Section: Context Class Labelsmentioning
confidence: 99%
See 1 more Smart Citation
“…Roth et al [31] enriched this analysis with the use of lemmas. Rashwan et al [32] developed an Arab stochastic diacritizer based on a hybrid approach of factored and not factored textual features. They introduced the stochastic system with dual methods to automatically diacritize plain Arabic text.…”
Section: Related Workmentioning
confidence: 99%
“…In [15], an approach was proposed which combines lexical retrieval, bigram-based and SVM-statistical prioritized techniques. In [14], the authors proposed two methods: the first uses an n-gram statistical language model along with A * lattice search while the second method attempts to segment each Arabic word into all its possible morphological constituents then proceed in a similar way as the first one. The authors reported that their second approach gives better results.…”
Section: Related Workmentioning
confidence: 99%