2008
DOI: 10.1016/j.sigpro.2008.06.013
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of off-line printed Arabic text using Hidden Markov Models

Abstract: This paper describes a technique for automatic recognition of off-line printed Arabic text using Hidden Markov Models. In this work different sizes of overlapping and non-overlapping hierarchical windows are used to generate 16 features from each vertical sliding strip. Eight different Arabic fonts were used for testing (viz. Arial, Tahoma, Akhbar, Thuluth, Naskh, Simplified Arabic, Andalus, and Traditional Arabic). It was experimentally proven that different fonts have their highest recognition rates at diffe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
29
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 76 publications
(29 citation statements)
references
References 35 publications
0
29
0
Order By: Relevance
“…Al-Muhtaseb et al proposed a hierarchical sliding window for printed Arabic text recognition [22]. A window is divided into eight non-overlapping vertical segments.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Al-Muhtaseb et al proposed a hierarchical sliding window for printed Arabic text recognition [22]. A window is divided into eight non-overlapping vertical segments.…”
Section: Related Workmentioning
confidence: 99%
“…These hierarchical windows generate features with greater weight in the center region of the writing line (baseline). These hierarchical windows produced very high recognition rates with synthesized data [22]. Experiments with text line images extracted from scanned documents yielded poor results [23].…”
Section: Related Workmentioning
confidence: 99%
“…It can be used to extract all Arabic characters in their all forms. It was shown in [5] that when the number of samples for some Arabic shapes are not enough in the training data, the recognition rate can be improved by including 50 samples of a minimal text paragraph that covers all shapes of Arabic characters to train the HMM. The second paragraphs in this page were selected randomly from a large corpus that we developed from 46 sources to make the database a statistical representation of the corpus.…”
Section: Introductionmentioning
confidence: 99%
“…The author of PATS dataset [3], the first dataset we used to test our algorithm, uses HMM and a sliding window to segment and recognize Arabic scripts. See also the recent [2].…”
Section: Introductionmentioning
confidence: 99%