2005
DOI: 10.1109/tpami.2005.227
|View full text |Cite
|
Sign up to set email alerts
|

Texture for script identification

Abstract: Abstract-The problem of determining the script and language of a document image has a number of important applications in the field of document analysis, such as indexing and sorting of large collections of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate the use of texture as a tool for determining the script of a document image, based on the observation that text has a distinct visual texture. An experimental evaluation of a number of commonly used texture … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
83
0
4

Year Published

2009
2009
2019
2019

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 154 publications
(87 citation statements)
references
References 45 publications
0
83
0
4
Order By: Relevance
“…Writer recognition systems can typically make use of global features such as texture, curvature and slant features [10][11][12] as well as a combination of local features such as graphemes, allographs and connected components [1,13,14] Our work falls into the latter category of text-independent techniques where the writers are not bounded by any specific lines of text in order for the system to recognize them. Instead, the system analyzes their handwriting styles through a series of automated processes, regardless of what they have written.…”
Section: Previous Workmentioning
confidence: 99%
“…Writer recognition systems can typically make use of global features such as texture, curvature and slant features [10][11][12] as well as a combination of local features such as graphemes, allographs and connected components [1,13,14] Our work falls into the latter category of text-independent techniques where the writers are not bounded by any specific lines of text in order for the system to recognize them. Instead, the system analyzes their handwriting styles through a series of automated processes, regardless of what they have written.…”
Section: Previous Workmentioning
confidence: 99%
“…In contrast, global approaches (Joshi [4]) employ an analysis of regions (block of text) comprising atleast two lines (or words)without finer segmentation. In general, global approaches work well based on texture measurement, but this relies heavily on a uniform block of text (Buschet al [5]), and extensive preprocessing (to make the text block uniform) is required to measure the texture. Even though local approaches rely on the accuracy of character segmentation or connected component analysis, it could work well on the documents irrespective of their quality or uniformity in the block of text.…”
Section: B Script Recognitionmentioning
confidence: 99%
“…Fourteen textural features extracted of the GLCM have been initially introduced by (Haralick et al, 1973) for texture discrimination of natural and satellite images. A number of co-occurrence feature extraction and analysis methods (Mikkilineni et al, 2005;Lin et al, 2006;Payne et al, 1994;Peake and Tan, 1997;Busch et al, 2005) have been proposed in order to segment and classify the content of document images, and to identify script and language from document images. Briefly, the GLCM matrices are obtained for a small range of distance values d = 1, 2 and typically for the directions θ = {0 • , 45 • , 90 • , 135 • } (Busch et al, 2005).…”
Section: Co-occurrence Featuresmentioning
confidence: 99%
“…Six co-occurrence features are extracted from the GLCM matrices: the maximum entry in the GLCM or the maximum probability, the correlation metric, the energy or the angular second moment, the entropy, the inertia or the contrast, and the local homogeneity for two distances {d = 1, 2} (Mikkilineni et al, 2005;Busch et al, 2005). In addition to the twelve cooccurrence features (six for each distance), two other descriptors are computed: the mean value and the standard deviation of the energy for the two distances combined (Lin et al, 2006).…”
Section: Co-occurrence Featuresmentioning
confidence: 99%
See 1 more Smart Citation