2020
DOI: 10.1111/exsy.12565
|View full text |Cite
|
Sign up to set email alerts
|

Deep OCR for Arabic script‐based language like Pastho

Abstract: Developing cursive script recognition systems have always been a challenging task for researchers. This article proposes a ligature-based recognition system for the cursive Pashto script using four pre-trained CNN models using a fine-tuned approach. The SqueezeNet, ResNet, MobileNet and DenseNet models have been observed for the classification and the recognition of Pashto sub-word (ligature). Overall, the proposed system is divided into two domains (Source and Target). The source domain contains the pre-train… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…The architecture includes four convolutional layers, two max pooling layers, and a fully connected layer with a dropout of 20% of the nodes. Naz et al [82] proposed Pashto ligature (subword) OCR approach. The approach achieved 99.31% using the DenseNet CNN architecture.…”
Section: Learned Featuresmentioning
confidence: 99%
“…The architecture includes four convolutional layers, two max pooling layers, and a fully connected layer with a dropout of 20% of the nodes. Naz et al [82] proposed Pashto ligature (subword) OCR approach. The approach achieved 99.31% using the DenseNet CNN architecture.…”
Section: Learned Featuresmentioning
confidence: 99%
“…OCR process converts image of printed or handwritten text into digital text that can be modified, processed, searched, and copied [7]. Although there are still several shortcomings of the technology that need to be treated and resolved to raise accuracy, OCR is a mesmerizing technology that has shouldered computers to digitize texts [8]. The manipulation process includes segmentation image into lines, words and parts of words, thus segmenting the word image into images of characters.…”
Section: Introductionmentioning
confidence: 99%