2019
DOI: 10.1007/s00521-019-04163-3
|View full text |Cite
|
Sign up to set email alerts
|

ARDIS: a Swedish historical handwritten digit dataset

Abstract: This paper introduces a new image-based handwritten historical digit dataset named Arkiv Digital Sweden (ARDIS). The images in ARDIS dataset are extracted from 15,000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single-digit datasets and one-digit string dataset. The digit string dataset includes 10,000 samples in red-green-blue color space, whereas the other datasets contain 7… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 56 publications
(28 citation statements)
references
References 52 publications
0
28
0
Order By: Relevance
“…The first phase of methodology comprises the MSTAR image dataset collection and CNN is applied for the image classification on SAR images. The detailed architecture for the CNN model has been influenced from [16] and [17] with empirical modifications for best possible validation. The architecture diagram of proposed CNN is depicted in figure 3.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…The first phase of methodology comprises the MSTAR image dataset collection and CNN is applied for the image classification on SAR images. The detailed architecture for the CNN model has been influenced from [16] and [17] with empirical modifications for best possible validation. The architecture diagram of proposed CNN is depicted in figure 3.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…In a similar method proposed by Romero et al [20], the heuristic line segmentation is further replaced with an HMM-based approach. Moving from HMM to convolutional approaches, Chammas et al [21] use convolutional neural network (CNN) features in combination with long short-term memory (LSTM) cells and connectionist temporal classification (CTC) to align text line images with their transcription. In their work, line segmentation is achieved heuristically by means of contour distribution analysis.…”
Section: Transcription Alignmentmentioning
confidence: 99%
“…Moreover, in this project, the USPS test dataset of 2007 samples are also used for testing purposes. USPS [47,48] comprises 7291 training samples and 2007 testing samples in grayscale for the digits 0 to 9.…”
Section: Used Datasetsmentioning
confidence: 99%