2018
DOI: 10.48550/arxiv.1812.02475
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Binary Document Image Super Resolution for Improved Readability and OCR Performance

Ram Krishna Pandey,
K Vignesh,
A G Ramakrishnan
et al.

Abstract: There is a need for information retrieval from large collections of low-resolution (LR) binary document images, which can be found in digital libraries across the world, where the high-resolution (HR) counterpart is not available. This gives rise to the problem of binary document image superresolution (BDISR). The objective of this paper is to address the interesting and challenging problem of super resolution of binary Tamil document images for improved readability and better optical character recognition (OC… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
15
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(16 citation statements)
references
References 42 publications
1
15
0
Order By: Relevance
“…[32] compared the performance of several artificial filters on down-sampled text images. [36] propose a convolutiontransposed convolution architecture to deal with binary document SR. [9] adapt SRCNN [8] in text image SR in the ICDAR 2015 competition TextSR [37] and achieved a good performance, but no text-oriented method was proposed.…”
Section: Related Workmentioning
confidence: 99%
“…[32] compared the performance of several artificial filters on down-sampled text images. [36] propose a convolutiontransposed convolution architecture to deal with binary document SR. [9] adapt SRCNN [8] in text image SR in the ICDAR 2015 competition TextSR [37] and achieved a good performance, but no text-oriented method was proposed.…”
Section: Related Workmentioning
confidence: 99%
“…The previous super-resolution methods usually try to learn degradation patterns through HR-LR pairs with global loss functions (e.g., L1 or L2 loss) to recover every pixel in text images (Xu et al 2017;Pandey et al 2018). These methods, however, usually view text images as general images regardless of text-specific properties.…”
Section: Introductionmentioning
confidence: 99%
“…A Lat et al [24] improved OCR accuracy by super-resolving document images using SRGAN [19]. Another closely related work, [25] uses deep neural network architectures for Binary Document Image Super-Resolution (BDISR) in Tamil script. These works are only restricted to document images, whereas our method solves a broader problem of text image SR while improving OCR confidence.…”
Section: Related Workmentioning
confidence: 99%