2023
DOI: 10.48550/arxiv.2302.02073
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GDB: Gated convolutions-based Document Binarization

Abstract: Document binarization is a key pre-processing step for many document analysis tasks. However, existing methods can not extract stroke edges finely, mainly due to the fairtreatment nature of vanilla convolutions and the extraction of stroke edges without adequate supervision by boundaryrelated information. In this paper, we formulate text extraction as the learning of gating values and propose an endto-end gated convolutions-based network (GDB) to solve the problem of imprecise stroke edge extraction. The gated… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 44 publications
0
7
0
Order By: Relevance
“…But it can also be clearly seen that the results of document image binarization based on deep learning are far better than those of traditional methods, so the algorithms of document image binarization in the past three years are basically studied on the technology of deep learning. Such as literature [57][58][59][60][61][62][63][66][67][68][69]72,73].…”
Section: Deep Learning Binarization Approachesmentioning
confidence: 99%
See 3 more Smart Citations
“…But it can also be clearly seen that the results of document image binarization based on deep learning are far better than those of traditional methods, so the algorithms of document image binarization in the past three years are basically studied on the technology of deep learning. Such as literature [57][58][59][60][61][62][63][66][67][68][69]72,73].…”
Section: Deep Learning Binarization Approachesmentioning
confidence: 99%
“…The models of document image binarization based on deep learning technology are mainly divided into two research directions. One is the model obtained based on Convolutional Neural Network (CNN) [19,[47][48][49][50][51][52][53][54][55][56][57][58][59][60][61][62][63], and the other is based on Generative Adversarial Networks (GANs) [66][67][68][69].…”
Section: Deep Learning Binarization Approachesmentioning
confidence: 99%
See 2 more Smart Citations
“…But it can also be clearly seen that the results of document image binarization based on deep learning are far better than those of traditional methods, so the algorithms for document image binarization that have been developed over the past three years have generally been based on deep learning technology. Such as literature [57][58][59][60][61][62][63][66][67][68][69][72][73][74], especially the model of Biswas et al [72] has almost obtained the best results on most of the DIBCO datasets. Biswas et al's model is a document binarization encoder-decoder architecture based on a Tokens-to-token vision transformer, employing a progressive tokenization technique to capture the local information from an image to achieve more effective binarization results.…”
Section: Deep Learning Binarization Approachesmentioning
confidence: 99%