2019
DOI: 10.1109/access.2019.2924449
|View full text |Cite
|
Sign up to set email alerts
|

A Human-Inspired Recognition System for Pre-Modern Japanese Historical Documents

Abstract: Recognition of historical documents is a challenging problem due to the noised, damaged characters, and background. However, in Japanese historical documents, not only contains the mentioned problems, pre-modern Japanese characters were written in cursive and are connected. Therefore, character segmentation-based methods do not work well. This leads to the idea of creating a new recognition system. In this paper, we propose a human-inspired document reading system to recognize multiple lines of premodern Japan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 14 publications
0
12
0
Order By: Relevance
“…Predicting Ordering and Layout in Historical Japanese Documents: The closest work that tried to determine Kuzushiji reading order was done by [7]. Human reading behavior was defined as people determining the start character of a paragraph/line, with the idea that they move the eyes from the current character/word to the next character/word.…”
Section: Related Workmentioning
confidence: 99%
“…Predicting Ordering and Layout in Historical Japanese Documents: The closest work that tried to determine Kuzushiji reading order was done by [7]. Human reading behavior was defined as people determining the start character of a paragraph/line, with the idea that they move the eyes from the current character/word to the next character/word.…”
Section: Related Workmentioning
confidence: 99%
“…It contains two modules: a DenseNets for feature extraction and an LSTM Decoder with an attention model for generating the target characters. We employed a similar setting for the system as our previous works [4]. The advantage of our model is that it requires images and corresponding transcriptions without bounding boxes of characters.…”
Section: Overview Of Human Inspired Recognition Systemmentioning
confidence: 99%
“…Figure 7 shows the process of generating multiple lines dataset. In the previous work [4], we were able to train the recognition on multiple lines. Recognition of multiple lines is easier than that of full-page documents.…”
Section: Curriculum Learning For Human-inspired Recognition Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…Text recognition is the process of converting the text in one or more images into documents such as those written on a computer [1][2][3]. Currently, the word recognition problem has been studied and widely used in the automation of office operations in many languages like English [4,5], Chinese [6,7], Japanese [8]. In Vietnam, there are some identification systems such as VietOCR software [9] and VNDOCR [10].…”
Section: Introductionmentioning
confidence: 99%