2019
DOI: 10.1016/j.patcog.2018.08.011
|View full text |Cite
|
Sign up to set email alerts
|

A selectional auto-encoder approach for document image binarization

Abstract: Binarization plays a key role in the automatic information retrieval from document images. This process is usually performed in the first stages of documents analysis systems, and serves as a basis for subsequent steps. Hence it has to be robust in order to allow the full analysis workflow to be successful. Several methods for document image binarization have been proposed so far, most of which are based on hand-crafted image processing strategies. Recently, Convolutional Neural Networks have shown an amazing … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
122
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 139 publications
(122 citation statements)
references
References 34 publications
0
122
0
Order By: Relevance
“…In summary, the differences of the proposed method with the existed works [16,20,14,21] are summarized as follows. (1) Unlike the previous methods which train the neural network to learn the labels of each pixel, the output of our method is the latent uniform version of the input images, which represents an internally enhanced version of the image.…”
Section: Introductionmentioning
confidence: 99%
“…In summary, the differences of the proposed method with the existed works [16,20,14,21] are summarized as follows. (1) Unlike the previous methods which train the neural network to learn the labels of each pixel, the output of our method is the latent uniform version of the input images, which represents an internally enhanced version of the image.…”
Section: Introductionmentioning
confidence: 99%
“…In the work of Vo et al [17], a Markov Random Field is used to binarize the documents from a foreground modeling based on the color of the staff lines. However, the main problem found in these options is the varying performance depending on the characteristics of the document [18,19].…”
Section: Introductionmentioning
confidence: 99%
“…In Table 3 we show the measurements of previously trained network for the H-DIBCO'18 dataset. We have to notice that it outperformed all participants of the H-DIBCO'18 on the target dataset [37]. Moreover, the organizers also have published results of proposed methods obtained for DIBCO'17 dataset in [37] in Table II.…”
Section: Resultsmentioning
confidence: 91%
“…We have to notice that it outperformed all participants of the H-DIBCO'18 on the target dataset [37]. Moreover, the organizers also have published results of proposed methods obtained for DIBCO'17 dataset in [37] in Table II. The situation here is the same: no new method was good enough to improve results of the 2017 year.…”
Section: Resultsmentioning
confidence: 91%