Proceedings of the Fourth International Conference on Document Analysis and Recognition
DOI: 10.1109/icdar.1997.620628
|View full text |Cite
|
Sign up to set email alerts
|

Representing OCRed documents in HTML

Abstract: OCR is an error-prone process. It is time-consuming and expensive t o m a n ually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in reading and understanding if they do not refer to the original image representation. As demonstrated in this paper, a hybrid document which combines symbolic representation and image representation may relieve the problem. If we represent a OCRed document properly in HTML, OCR errors will not have m uch negative eect on the human reading proc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 1 publication
0
0
0
Order By: Relevance