2022
DOI: 10.1109/access.2022.3216881
|View full text |Cite
|
Sign up to set email alerts
|

PHTI: Pashto Handwritten Text Imagebase for Deep Learning Applications

Abstract: Document Image Analysis (DIA) is one of the research areas of Artificial Intelligence (AI) that converts document images into machine-readable codes. In DIA systems, Optical Character Recognition (OCR) plays a key role in digitizing document images. The output of an OCR system is further used in many applications including, Natural Language Processing (NLP), Sentiment Analysis, Speech Recognition, and Translation Services. However, standard datasets are an essential requirement for the development, evaluation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 33 publications
0
8
0
Order By: Relevance
“… Hussain et al (2022) published a dataset named Pashto Handwritten Text Imagebase (PHTI). The PHTI contains text line images presenting real world Pashto text in handwritten form.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“… Hussain et al (2022) published a dataset named Pashto Handwritten Text Imagebase (PHTI). The PHTI contains text line images presenting real world Pashto text in handwritten form.…”
Section: Related Workmentioning
confidence: 99%
“…This article recognizes Pashto handwritten text using MD-LSTM ( Graves & Schmidhuber, 2008 ) architecture. Several experiments figure out the best hidden layer size for the proposed MD-LSTM system as a benchmark on the PHTI dataset ( Hussain et al, 2022 ). The proposed architecture is different from the model of Graves & Schmidhuber (2008) and from the model of Ahmad et al (2016) in terms of number of hidden layer size and Tanh layer size for LSTM units.…”
Section: Introductionmentioning
confidence: 99%
“…The evaluation results show that the LSTM performs significantly better than the HMM and traditional feature extraction techniques such as SIFT, particularly in the presence of scale and rotation variations. [61]This research introduces a new dataset called KPTI for the recognition of Pashto text. It contains 17,015 images of text lines with corresponding ground truth.…”
Section: : Related Workmentioning
confidence: 99%
“…Convolutional Neural Network (CNN), which was initially developed in the Neural Network Image Processing Community (NNIPC), is a famous kind of Feed-forward Neural Network (FNN) with a profound structure and has shown outstanding simulation results in various tasks, particularly in NLP tasks, such as sentence analysis of various languages in different applications. Multiple types of CNN, including the typical model, have been an important focus of research as they can be applied to complex problems involving time-varying patterns [49][50][51]. The standard CNN involves two operations, which feature extractors, convolution and pooling; the obtained output is then associated with the following [23].…”
Section: Convolutional Neural Networkmentioning
confidence: 99%