2006
DOI: 10.1007/s10032-006-0020-2
|View full text |Cite
|
Sign up to set email alerts
|

A survey of document image classification: problem statement, classifier architecture and performance evaluation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
73
0
3

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 145 publications
(76 citation statements)
references
References 43 publications
0
73
0
3
Order By: Relevance
“…It comprises a set of simple techniques and procedures, which are used to work upon the images of documents and exchange them from pixel information into a format that can be read by a computer. Information retrieval from the document images is a challenging task, hence number of techniques and procedures are used for document image processing [3]. Converting a scanned grey scale image into a binary image, the foreground (or regions of interest) and removing the background is an important step in many image analysis systems includes document image processing.…”
Section: Introductionmentioning
confidence: 99%
“…It comprises a set of simple techniques and procedures, which are used to work upon the images of documents and exchange them from pixel information into a format that can be read by a computer. Information retrieval from the document images is a challenging task, hence number of techniques and procedures are used for document image processing [3]. Converting a scanned grey scale image into a binary image, the foreground (or regions of interest) and removing the background is an important step in many image analysis systems includes document image processing.…”
Section: Introductionmentioning
confidence: 99%
“…SIFT [25] including both the detector and descriptor as a whole system provides better results compared to other systems under Mikolajczyk`s evaluation framework [26] but It's known that SIFT suffers from a high computational complexity [27]. PCA-SIFT is one of the most successful extensions [28], which applies Principal Components Analysis (PCA) [29], a standard technique for dimensional reduction, on the target image patch and produces a similar descriptor as the standard SIFT one. Another algorithm called i-SIFT, has been presented based on the PCA-SIFT by Fritz et al [30], which applied Information Theoretic Selection to cut down the number of interest points so as to reduce the cost for computing descriptors.…”
Section: Introductionmentioning
confidence: 99%
“…This article is a contribution to the extensive literature on document classification, which is surveyed by Nawei Chen and Dorothea Blostein (2007), especially efforts (such as ours) that make use of image features for categorizing documents. The types of features we use here are by now standard.…”
Section: Introductionmentioning
confidence: 99%