2004
DOI: 10.1007/978-3-540-28640-0_9
|View full text |Cite
|
Sign up to set email alerts
|

A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives

Abstract: Abstract. This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentratio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2005
2005
2022
2022

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 7 publications
0
2
0
Order By: Relevance
“…In order to store archive information data that will be processed in a certain standard format in the computer after logical classification and standardised processing, a digital archive information database must be established. The order, integrity, readability, and security of the material contained in the archives must be improved, which necessitates making full use of database technology to rationally arrange and handle the resources [ 6 ]. The last step in both the process of managing archives and realizing the potential value of using archive information resources is the development and exploitation of those resources.…”
Section: Introductionmentioning
confidence: 99%
“…In order to store archive information data that will be processed in a certain standard format in the computer after logical classification and standardised processing, a digital archive information database must be established. The order, integrity, readability, and security of the material contained in the archives must be improved, which necessitates making full use of database technology to rationally arrange and handle the resources [ 6 ]. The last step in both the process of managing archives and realizing the potential value of using archive information resources is the development and exploitation of those resources.…”
Section: Introductionmentioning
confidence: 99%
“…The purpose of this step is to remove any areas (e.g., scanner borders and reconstructed paper regions) in the image that cannot possibly contain any useful information and at the same time have inconsistent characteristics with the remainder of the document area [3].…”
Section: Segmentation Of Non-content Regionsmentioning
confidence: 99%