2016
DOI: 10.1117/1.jei.25.3.033014
|View full text |Cite
|
Sign up to set email alerts
|

Robust skew estimation using straight lines in document images

Abstract: A skew-estimation method using straight lines in document images is presented. Unlike conventional approaches exploiting the properties of text, we formulate the skew-estimation problem as an estimation task using straight lines in images and focus on robust and accurate line detection. To be precise, we adopt a blockbased edge detector followed by a progressive line detector to take clues from a variety of sources such as text lines, boundaries of figures/tables, vertical/horizontal separators, and boundaries… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…We will continue to extend our investigation to develop more suitable CNN models for document classification tasks. This would involve exploring more CNN architectures such as inception 12 and DenseNet; 77 more document image preprocessing techniques such as Zemouri and Chibani's 78 binarization for degraded document images, Koo and Cho's skewness estimation, 79 and image augmentation 80 to increase the amount of "groundtruth" training data; and more document image databases such as a medieval document image collection. 81 Second, our investigations revealed impact trends of coupling preprocessing the CNN's performance and demonstrated that the impact of coupling preprocessing could stem from different factors.…”
Section: Discussionmentioning
confidence: 99%
“…We will continue to extend our investigation to develop more suitable CNN models for document classification tasks. This would involve exploring more CNN architectures such as inception 12 and DenseNet; 77 more document image preprocessing techniques such as Zemouri and Chibani's 78 binarization for degraded document images, Koo and Cho's skewness estimation, 79 and image augmentation 80 to increase the amount of "groundtruth" training data; and more document image databases such as a medieval document image collection. 81 Second, our investigations revealed impact trends of coupling preprocessing the CNN's performance and demonstrated that the impact of coupling preprocessing could stem from different factors.…”
Section: Discussionmentioning
confidence: 99%
“…For realistic test images, we have scanned printed documents with flatbed-scanners, and applied the following procedures to the scanned images: skew correction [31], text line extraction [32], and word segmentation [30]. In building the testset, we have used multi-script words by concatenating English, Chinese, and Korean characters as shown in Figure 5.…”
Section: B Test Word Datasetmentioning
confidence: 99%
“…Knowledge about the lines in an image is useful in many applications such as unmanned vehicle guidance, robot navigation, medical image processing, object recognition, computer vision and artificial intelligence [1][2][3][4][5][6][7][8][9][10][11][12]. In a high-resolution image, thousands of lines in different angles are possible.…”
Section: Introductionmentioning
confidence: 99%