2022
DOI: 10.21203/rs.3.rs-1514700/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Conv-Transformer Architecture for Unconstrained Off-LineUrdu Handwriting Recognition

Abstract: Unconstrained off-line handwriting text recognition in general and for Arabic-like scripts in particular is a challenging task and is still an active researcharea. Transformer based models for English handwriting recognition have recently shown promising results.In this paper, we have explored the use of transformerarchitecture for Urdu handwriting recognition. The useof a Convolution Neural Network before a vanilla fullTransformer and using Urdu printed text-lines alongwith handwritten text lines during the t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…Experiments were performed using a 1-dimensional Bi-Directional LSTM to classify individual character labels. Other works such as [35] use Transformer architectures with a CNN backbone for unconstrained character classification.…”
Section: Related Work a Urdu Handwritten Character Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…Experiments were performed using a 1-dimensional Bi-Directional LSTM to classify individual character labels. Other works such as [35] use Transformer architectures with a CNN backbone for unconstrained character classification.…”
Section: Related Work a Urdu Handwritten Character Recognitionmentioning
confidence: 99%
“…The lack of properly structured and annotated data has always been a major hindrance for these types of tasks. Most recent methods [2], [35], [28] that involve the use of data-intensive and hungry architectures like LSTMs and Transformers usually are unable to perform when provided with sparse datasets. With continually changing styles and patterns, the Urdu language provides just that in the real world.…”
Section: Related Work a Urdu Handwritten Character Recognitionmentioning
confidence: 99%
“…Here the output is the weighted sum of the value vector, where weight is computed using the softmax function. Attention is defined as shown in equation ( 2) (40).…”
Section: Vision Transformer Architecturementioning
confidence: 99%