2004
DOI: 10.1007/978-3-540-28640-0_11
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation of Handwritten Characters for Digitalizing Korean Historical Documents

Abstract: Abstract. The historical documents are valuable cultural heritages and sources for the study of history, social aspect and life at that time. The digitalization of historical documents aims to provide instant access to the archives for the researchers and the public, who had been endowed with limited chance due to maintenance reasons. However, most of these documents are not only written by hand in ancient Chinese characters, but also have complex page layouts. As a result, it is not easy to utilize convention… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 5 publications
0
6
0
Order By: Relevance
“…The average recognition score, SA, of the segmented text image is represented by these two scores as follows: (12) where i and N denote the index and the number of characters, KG and KR are constants and Si, SG, and SR denote the total, geometric, and recognition scores of each character, respectively. KG and KR, are usually assigned as 0.3 and 5, respectively.…”
Section: Adjustment Of the Segmentation Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The average recognition score, SA, of the segmented text image is represented by these two scores as follows: (12) where i and N denote the index and the number of characters, KG and KR are constants and Si, SG, and SR denote the total, geometric, and recognition scores of each character, respectively. KG and KR, are usually assigned as 0.3 and 5, respectively.…”
Section: Adjustment Of the Segmentation Resultsmentioning
confidence: 99%
“…The optimal separation path is determined by using two scores: (1) the geometric score that is used to estimate the likelihood of "being a character" by geometric features and (2) the recognition score obtained by the character recognition function. The geometric score is calculated by using two character evaluators, the squareness (SQU) and the internal gap (GAP), which are estimated by the Parzen window [12] . If the geometric score is low, the sub-image is eliminated in the separation paths and not recognized, by which the overhead of recognition can be reduced.…”
Section: Adjustment Of the Segmentation Resultsmentioning
confidence: 99%
“…Digitization of historical archives and application of information retrieval methods on them have gained pace in recent decades, including non-European handwritten archival collections [11]. Some historical documents might have a tabular structure, which makes it easier to analyze the layout.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, some research proposed supporting methods and systems to decode historical documents [1]- [3]. We also proposed a system to help archeologists decode mokkans.…”
Section: Damaged Too Muchmentioning
confidence: 99%