2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00499
|View full text |Cite
|
Sign up to set email alerts
|

Blind Geometric Distortion Correction on Images Through Deep Learning

Abstract: We propose the first general framework to automatically correct different types of geometric distortion in a single input image. Our proposed method employs convolutional neural networks (CNNs) trained by using a large synthetic distortion dataset to predict the displacement field between distorted images and corrected images. A model fitting method uses the CNN output to estimate the distortion parameters, achieving a more accurate prediction. The final corrected image is generated based on the predicted flow… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
79
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 77 publications
(79 citation statements)
references
References 43 publications
(61 reference statements)
0
79
0
Order By: Relevance
“…Blind distortion correction is an ill-posed problem. Therefore, learning-based methods using only a single distorted image are being pursued [ 10 , 11 , 12 , 42 , 43 , 44 , 45 , 46 ]. Deep learning for correcting documents were proposed recently [ 12 , 44 , 45 , 46 ] which implements convolutional neural networks, encoder-decoders, and U-net-based architectures [ 47 ].…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Blind distortion correction is an ill-posed problem. Therefore, learning-based methods using only a single distorted image are being pursued [ 10 , 11 , 12 , 42 , 43 , 44 , 45 , 46 ]. Deep learning for correcting documents were proposed recently [ 12 , 44 , 45 , 46 ] which implements convolutional neural networks, encoder-decoders, and U-net-based architectures [ 47 ].…”
Section: Related Workmentioning
confidence: 99%
“…Work on correcting portrait images used an encoder-decoder architecture [ 10 ]. The encoder-decoder architecture proposed by Li et al [ 11 ] aims to correct real-world images by predicting the distortion flow and further refining the correction by iterative resampling, which is a predecessor of our work. Instead of using a multi-model network for predicting the distortion flow, we used multiple convolutional neural networks (CNN) that run in parallel to predict the transformation matrix.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations