2020
DOI: 10.48550/arxiv.2007.03579
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

HKR For Handwritten Kazakh & Russian Database

Daniyar Nurseitov,
Kairat Bostanbekov,
Daniyar Kurmankhojayev
et al.

Abstract: In this paper, we present a new Russian and Kazakh database (with about 95% of Russian and 5% of Kazakh words/sentences respectively) for offline handwriting recognition. A few pre-processing and segmentation procedures have been developed together with the database. The database is written in Cyrillic and shares the same 33 characters. Besides these characters, the Kazakh alphabet also contains 9 additional specific characters. This dataset is a collection of forms. The sources of all the forms in the dataset… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3
1

Relationship

4
0

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…In this research, our contribution is to present a novel attention-based fully gated convolutional recurrent neural network, trained in Kazakh and Russian dataset [25], it will be processed as follows:…”
Section: Previous Approaches To the Offline Handwriting Text Recognit...mentioning
confidence: 99%
See 1 more Smart Citation
“…In this research, our contribution is to present a novel attention-based fully gated convolutional recurrent neural network, trained in Kazakh and Russian dataset [25], it will be processed as follows:…”
Section: Previous Approaches To the Offline Handwriting Text Recognit...mentioning
confidence: 99%
“…Our work is the first research paper in our HKR Dataset [25], this dataset is first open-source in Russian and Kazakh handwritten datasets until now there is no dataset in theses language available to the researchers…”
Section: Related Workmentioning
confidence: 99%
“…Deep learning has recently achieving state-of-the-art using convolutional neural network (CNN) [9] in many tasks including object detection [10], face recognition [11], sequence to sequence learning [12,13], speech recognition [14], semantic segmentation [15], image classification [16], handwritten recognition [17,18,19], and table detection [1,8,6] is demanding because they need to classify tables among the texts and other figures. The presence of split columns or rows, as well as nested tables or embedded figures, makes the detection of a table even more difficult.…”
Section: Introductionmentioning
confidence: 99%
“…Kernel principal component analysis (KPCA) was calculated from urinary organic (nuclear magnetic resonance) and inorganic (inductively coupled plasma optical emission spectrometry) data. [4,5,6] Along these same lines, existing pseudorandom and heterogeneous algorithms use adaptive information to cache link-level acknowledgments. The emulation of Internet QoS would tremendously improve robust technology.…”
Section: Introductionmentioning
confidence: 99%