Computer Science &Amp; Information Technology (CS &Amp; IT) 2020
DOI: 10.5121/csit.2020.101506
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Language-Independent Multi-Font OCR for Arabic Script

Abstract: Optical Character Recognition (OCR) is the process of extracting digitized text from images of scanned documents. While OCR systems have already matured in many languages, they still have shortcomings in cursive languages with overlapping letters such as the Arabic language. This paper proposes a complete Arabic OCR system that takes a scanned image of Arabic Naskh script as an input and generates a corresponding digital document. Our Arabic OCR system consists of the following modules: Pre-processing, Word-le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…The accuracy for the letters Aeen, Taa, Haa, Laam, and Waaw is perfect and has reached 100% to classify in sizes from 14 to 72 for all fonts. The small font sizes (8)(9)(10)(11)(12)(13)(14) and the accuracy range between 60-90% in all Arabic fonts Daal letter classification accuracy is in the range of 30-80% for sizes 8-14 and 90-100% for sizes 24-72. Raa letter classification accuracy is 20-70% for sizes (8)(9)(10)(11)(12)(13)(14)(15)(16) and 90% for sizes to all Arabic fonts.…”
Section: Cmfmentioning
confidence: 96%
See 1 more Smart Citation
“…The accuracy for the letters Aeen, Taa, Haa, Laam, and Waaw is perfect and has reached 100% to classify in sizes from 14 to 72 for all fonts. The small font sizes (8)(9)(10)(11)(12)(13)(14) and the accuracy range between 60-90% in all Arabic fonts Daal letter classification accuracy is in the range of 30-80% for sizes 8-14 and 90-100% for sizes 24-72. Raa letter classification accuracy is 20-70% for sizes (8)(9)(10)(11)(12)(13)(14)(15)(16) and 90% for sizes to all Arabic fonts.…”
Section: Cmfmentioning
confidence: 96%
“…Hussein Osman et al (2020) [12] Present a neural network model. The system has been tested on several open Arabic corpora datasets and has achieved outstanding results when compared to state-of-the-art Arabic OCR systems, with an average character segmentation accuracy of 98.06%, character recognition accuracy of 99.89%, and an overall system accuracy of 97.94%.…”
Section: Introductionmentioning
confidence: 99%
“…Less number of classes with de-shaped samples up to a certain limit (per class) is a way towards model simplicity. Reference [23] also used a heuristic method for segmentation with previously mentioned limitations. Such contemporary methods may work on clean data that is unfeasible for applications like Arabic news ticker recognition.…”
Section: Ticker Segmentationmentioning
confidence: 99%