2020
DOI: 10.48550/arxiv.2008.05373
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Attention-based Fully Gated CNN-BGRU for Russian Handwritten Text

Abdelrahman Abdallah,
Mohamed Hamada,
Daniyar Nurseitov

Abstract: This research approaches the task of handwritten text with attention encoderdecoder networks that are trained on Kazakh and Russian language. We developed a novel deep neural network model based on Fully Gated CNN, supported by Multiple bidirectional GRU and Attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER) and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test da… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 29 publications
(35 reference statements)
0
2
0
Order By: Relevance
“…In the case of Eastern European languages, the Kazakh Offline Handwritten Text Dataset (KOHTD) (Toiganbayeva et al ., 2022), a dataset of Cyrillic exam papers, and the Handwritten Kazakh and Russian (HKR) database, have recently emerged (Nurseitov et al ., 2021). These databases are essential for training and the eventual recognition of texts, allowing these materials to be consulted and analysed for the first time, as exemplified by Abdallah et al . (2020, p. 141) producing models with CERs as low as 0.045% and WERs as low as 0.192% using the HKR dataset.…”
Section: Resultsmentioning
confidence: 99%
“…In the case of Eastern European languages, the Kazakh Offline Handwritten Text Dataset (KOHTD) (Toiganbayeva et al ., 2022), a dataset of Cyrillic exam papers, and the Handwritten Kazakh and Russian (HKR) database, have recently emerged (Nurseitov et al ., 2021). These databases are essential for training and the eventual recognition of texts, allowing these materials to be consulted and analysed for the first time, as exemplified by Abdallah et al . (2020, p. 141) producing models with CERs as low as 0.045% and WERs as low as 0.192% using the HKR dataset.…”
Section: Resultsmentioning
confidence: 99%
“…A new attention-based fully gated convolutional RNN was proposed by Abdallah [24], this model was trained and tested on the HKR dataset [4]. This work shows the effect of the attention mechanism and the gated layer on selecting relative features.…”
Section: Related Workmentioning
confidence: 99%