Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318) 1999
DOI: 10.1109/icdar.1999.791879
|View full text |Cite
|
Sign up to set email alerts
|

A document image retrieval method tolerating recognition and segmentation errors of OCR using shape-feature and multiple candidates

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
5
0

Year Published

2003
2003
2019
2019

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 2 publications
0
5
0
Order By: Relevance
“…Motivated by this observation, some retrieval methods with the ability to tolerate the recognition errors of OCR have been researched later (Ohtam et al 1997). Additionally, some methods were reported to improve retrieval performance by using OCR candidates (Kameshiro et al 1999;Katsuyama 2002).…”
Section: Introductionmentioning
confidence: 95%
See 1 more Smart Citation
“…Motivated by this observation, some retrieval methods with the ability to tolerate the recognition errors of OCR have been researched later (Ohtam et al 1997). Additionally, some methods were reported to improve retrieval performance by using OCR candidates (Kameshiro et al 1999;Katsuyama 2002).…”
Section: Introductionmentioning
confidence: 95%
“…There are two primary approaches to locate the desirable text in the document images for retrieving the appropriate information; Optical Character Recognition (OCR) technique (Kameshiro et al 1999) and Document Image Retrieval (Keyword spotting) technique (Doermann 1998). Optical Character Recognition deals with the machine recognition of characters present in an input image obtained using scanning operation (Doermann 1998).…”
Section: Introductionmentioning
confidence: 99%
“…To search for a keyword in document images, first of all, by optical character recognition (OCR), we have to convert the format of document images from pictorial format to text format, which is translatable by the machine [1], and then by the use of the traditional methods of document retrieval, the target word is sought in the text. Although OCR is frequently used by researchers in this area, it has some disadvantages that cause OCR to be inappropriate in all retrieval cases.…”
Section: Introductionmentioning
confidence: 99%
“…For document retrieval from large database, it is necessary to build an index containing multiple candidate recognition results so as to overcome the recognition error. According to the indexing technique, handwritten document retrieval methods can be categorized into two groups: indexing by character recognition (transcription) 4,17,18 and lexicondriven indexing. 1,42 Transcription-based text search relies on the character recognition accuracy.…”
Section: Introductionmentioning
confidence: 99%