2020
DOI: 10.1007/978-3-030-57058-3_10
|View full text |Cite
|
Sign up to set email alerts
|

Dewarping Document Image by Displacement Flow Estimation with Fully Convolutional Network

Abstract: As camera-based documents are increasingly used, the rectification of distorted document images becomes a need to improve the recognition performance. In this paper, we propose a novel framework for both rectifying distorted document image and removing background finely, by estimating pixel-wise displacements using a fully convolutional network (FCN). The document image is rectified by transformation according to the displacements of pixels. The FCN is trained by regressing displacements of synthesized distort… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
41
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 31 publications
(43 citation statements)
references
References 25 publications
2
41
0
Order By: Relevance
“…Then, we use image similarity and OCR accuracy to evaluate the performance of illumination correction. To be specific, for pixel alignment, we use Local Distortion (LD) [43] as recommended in [7,22,41] to evaluate the geometric distortion of rectified images. For image similarity, we use Multi-Scale Structural SIMilarity (MS-SSIM) [39] as previous works [7,22,41] suggest.…”
Section: Experiments 51 Evaluation Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…Then, we use image similarity and OCR accuracy to evaluate the performance of illumination correction. To be specific, for pixel alignment, we use Local Distortion (LD) [43] as recommended in [7,22,41] to evaluate the geometric distortion of rectified images. For image similarity, we use Multi-Scale Structural SIMilarity (MS-SSIM) [39] as previous works [7,22,41] suggest.…”
Section: Experiments 51 Evaluation Metricsmentioning
confidence: 99%
“…To be specific, for pixel alignment, we use Local Distortion (LD) [43] as recommended in [7,22,41] to evaluate the geometric distortion of rectified images. For image similarity, we use Multi-Scale Structural SIMilarity (MS-SSIM) [39] as previous works [7,22,41] suggest. For OCR, following [7,22], we choose Edit Distance (ED) [17] and Character Error Rate (CER) to evaluate the capacity on text recognition.…”
Section: Experiments 51 Evaluation Metricsmentioning
confidence: 99%
“…However, the estimation and subsequent stitching of the warping flow patches heavily increase the computational cost. More recently, based on Fully Convolutional Network [49], Xie et al [16] perform a foreground/background classification as a post-processing to refine the predicted forward warping flow on boundary regions of the document.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, deep learning has been introduced to document image rectification with promising performance as well as a significant reduction in computational cost. In deep learning based methods [13], [14], [15], [16], [17], [18], [19], document image rectification is approached by directly regressing a dense 2D vector field (warping flow) that samples the pixels from the distorted images to the rectified ones. However, these methods still suffer from two non-trivial issues.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation