Abstract:Segmentation of curled textlines from warped document images is one of the major issues in document image dewarping. Most of the curled textlines segmentation algorithms present in the literature today are sensitive to the degree of curl, direction of curl, and spacing between adjacent lines. We present a new algorithm for curled textline segmentation which is robust to above mentioned problems at the expense of high execution time. We will demonstrate this insensitivity in a performance evaluation section. Ou… Show more
“…This section introduces the modified snakes model for estimating the information of x-line and baseline pairs from detected textlines. Our baby-snake [7] and snakelet [8] models are also based on snakes for curled textline detection, but from binarized document image. The modified snake model presented here is as an extension of [7,8].…”
“…Our baby-snake [7] and snakelet [8] models are also based on snakes for curled textline detection, but from binarized document image. The modified snake model presented here is as an extension of [7,8]. The features of modified snakes model for estimating x-line and baseline pairs are described below:…”
“…Previous approaches of curled textline detection [1,2,3,4,5,6,7,8] work on binarized images. These approaches can be divided into two categories: (a) heuristic search [1,2,3,4,5,6] and (b) active contours (snakes) [7,8].…”
Section: Introductionmentioning
confidence: 99%
“…Most of these approaches use rule-based criteria for textline searching. Our active contours (snakes) based baby-snakes [7] model introduced the use of many small open curved snakes and snakelets [8] model introduced many small growing coupled snakes pairs for curled textlines detection from binarized images.…”
Cameras offer flexible document imaging, but with uneven shading and non-planar page shape. Therefore cameracaptured documents need to go through dewarping before being processed by traditional text recognition methods. Curled textline detection is an important step of dewarping. Previous approaches of curled textline detection use binarization as a pre-processing step, which can negatively affect the detection results under uneven shading. Furthermore, these approaches are sensitive to high degrees of curl and estimate x-line 1 and baseline pairs using regression which may result in inaccurate estimation. We introduce a novel curled textline detection approach for grayscale document images. First, the textline structure is enhanced by using match filter bank smoothing and then central lines of textlines are detected using ridges. Then, x-line and baseline pairs are estimated by adapting active contours (snakes) over ridges. Unlike other approaches, our approach does not use binarization and applies directly on grayscale images. We achieved 91% of detection accuracy with good estimation of x-line and baseline pairs on the dataset of CBDAR 2007 document image dewarping contest.
“…This section introduces the modified snakes model for estimating the information of x-line and baseline pairs from detected textlines. Our baby-snake [7] and snakelet [8] models are also based on snakes for curled textline detection, but from binarized document image. The modified snake model presented here is as an extension of [7,8].…”
“…Our baby-snake [7] and snakelet [8] models are also based on snakes for curled textline detection, but from binarized document image. The modified snake model presented here is as an extension of [7,8]. The features of modified snakes model for estimating x-line and baseline pairs are described below:…”
“…Previous approaches of curled textline detection [1,2,3,4,5,6,7,8] work on binarized images. These approaches can be divided into two categories: (a) heuristic search [1,2,3,4,5,6] and (b) active contours (snakes) [7,8].…”
Section: Introductionmentioning
confidence: 99%
“…Most of these approaches use rule-based criteria for textline searching. Our active contours (snakes) based baby-snakes [7] model introduced the use of many small open curved snakes and snakelets [8] model introduced many small growing coupled snakes pairs for curled textlines detection from binarized images.…”
Cameras offer flexible document imaging, but with uneven shading and non-planar page shape. Therefore cameracaptured documents need to go through dewarping before being processed by traditional text recognition methods. Curled textline detection is an important step of dewarping. Previous approaches of curled textline detection use binarization as a pre-processing step, which can negatively affect the detection results under uneven shading. Furthermore, these approaches are sensitive to high degrees of curl and estimate x-line 1 and baseline pairs using regression which may result in inaccurate estimation. We introduce a novel curled textline detection approach for grayscale document images. First, the textline structure is enhanced by using match filter bank smoothing and then central lines of textlines are detected using ridges. Then, x-line and baseline pairs are estimated by adapting active contours (snakes) over ridges. Unlike other approaches, our approach does not use binarization and applies directly on grayscale images. We achieved 91% of detection accuracy with good estimation of x-line and baseline pairs on the dataset of CBDAR 2007 document image dewarping contest.
“…[Nikolaou et al 2010] applies a modified RLSA by adding CCs, white spaces, punctuation marks, and skeleton of the strokes knowledge to the smearing process. In [Bukhari et al 2008[Bukhari et al , 2009] the authors tackle the text segmentation problem by using active contours (snakes) as a base unit to minimize the energy function. The final lines are extracted by joining neighboring snakes after applying several deformations until they stick together.…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.