Digital Image Computing: Techniques and Applications (DICTA'05) 2005
DOI: 10.1109/dicta.2005.3
|View full text |Cite
|
Sign up to set email alerts
|

A Front-End OCR for Omni-Font Persian/Arabic Cursive Printed Documents

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…As is seen, the values of the horizontal coordinates are from 1 to 42. The numbers in the range of (1)(2)(3)(4)(5)(6) in the horizontal coordinates are symbols for the connected components with sizes 1 × 1, 1 × 2, 1 × 3, 1 × 4, 1 × 5, and 1 × 6; and the numbers in the range of (7)(8)(9)(10)(11)(12) are symbols for the connected components with sizes 2 × 1, 2 × 2, 2 × 3, 2 × 4, 2 × 5, and 2 × 6, respectively.…”
Section: Font Recognition and Keyword Modificationmentioning
confidence: 99%
See 1 more Smart Citation
“…As is seen, the values of the horizontal coordinates are from 1 to 42. The numbers in the range of (1)(2)(3)(4)(5)(6) in the horizontal coordinates are symbols for the connected components with sizes 1 × 1, 1 × 2, 1 × 3, 1 × 4, 1 × 5, and 1 × 6; and the numbers in the range of (7)(8)(9)(10)(11)(12) are symbols for the connected components with sizes 2 × 1, 2 × 2, 2 × 3, 2 × 4, 2 × 5, and 2 × 6, respectively.…”
Section: Font Recognition and Keyword Modificationmentioning
confidence: 99%
“…In keyword spotting methods, searching is done in the image domain without converting to text. Although there have been great attempts in producing OCR systems for the Farsi/Arabic language, such as those in [1,2], the overall performances of such systems are far from perfect.…”
Section: Introductionmentioning
confidence: 99%
“…The most important of these disadvantages is that it costs a lot in converting huge amounts of documents and also it is not sufficiently successful in applying it on low quality texts and documents with complicated layout. Additionally, there is no robust OCR method available yet for Farsi language scripts [2,3]. In order to overcome these problems, researchers suggested another method for document image retrieval that is called keyword spotting or, more simply, word spotting [4].…”
Section: Introductionmentioning
confidence: 99%
“…The locative layout structure of a word image and classification of components of its layout have useful information for recognizing Farsi words. According to a literature review above and considering the method discussed in [2,3], in this paper, we propose a new model for machineprinted Farsi text retrieval based on the similarities of layout of components in Farsi words. The new method is actually the implementation of the method proposed in [28,29].…”
Section: Introductionmentioning
confidence: 99%
“…Most of retrieval and recognition methods are divided into two category [1]: The first category methods retrieval and recognation document images based on description of global shape of words or sub-words. In this method the descriptor are directly extracted from the image of the word or subword [1,3,4,9,10,15]. The second category methods segment a word to its letters and then extract features from the image of each letter.…”
Section: Introductionmentioning
confidence: 99%