2021
DOI: 10.1109/access.2021.3110787
|View full text |Cite
|
Sign up to set email alerts
|

MMU-OCR-21: Towards End-to-End Urdu Text Recognition Using Deep Learning

Abstract: Optical Character Recognition (OCR) is a technique that generates text from an image. Recognizing the importance of OCR in real-world settings, a plethora of techniques have been developed for Western, as well as Asian languages. Urdu is a prominent South Asian language and a number of different solutions for Urdu OCR have been proposed. However, fewer attempts have been made to develop end-to-end deep learning-based solutions for recognizing printed Urdu text. Furthermore, several benchmark corpora for Urdu O… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 58 publications
(61 reference statements)
0
9
0
1
Order By: Relevance
“…The episodic framework was introduced by Nasir et al (2021). It provides a simulation for training a meta-learning model for few-shot classification tasks.…”
Section: Episodic Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…The episodic framework was introduced by Nasir et al (2021). It provides a simulation for training a meta-learning model for few-shot classification tasks.…”
Section: Episodic Frameworkmentioning
confidence: 99%
“…The challenge of converting manuscripts and printed documents into digital formats has been the focus of computer vision research (Nasir et al, 2021;Gowda and Kanchana, 2022). Recent advances have blurred the interface between physical copies of text and their digital counterparts.…”
Section: Introductionmentioning
confidence: 99%
“…The architecture is essentially an end-to-end fully convolutional network (FCN), and in essence, it only contains convolutional layers and no dense layers, enabling it to be used with images of varying sizes [24]. In Figure 8 given input of an image of a hand-filled form, a trained semantic segmentation algorithm can output a segmentation map, a blank image wherein only the areas of interest (in this case, relevant text) are highlighted in black [25]. This segmentation map is crucial in understanding which areas of an image pertain to the major class in consideration.…”
Section: ) Relevant Block Identification and Isolation Modulementioning
confidence: 99%
“…Pengimplementasian OCR dapat dilakukan diberbagai ranah penelitian seperti, structural text form recognition [6], automatic number plate recognition [7], alat bantu baca tuna netra, pengenalan tulisan tangan (handwritten) dan sebagainya. OCR juga dapat digunakan untuk melestarikan dan mendigitalkan script [3] atau sastra kuno sebagai warisan [1] [8]. Identifikasi script merupakan langkah penting dalam pemrosesan gambar dokumen terutama ketika lingkungan multiskrip atau multibahasa [9].…”
Section: Pendahuluanunclassified