2016
DOI: 10.14569/ijacsa.2016.070469
|View full text |Cite
|
Sign up to set email alerts
|

Multilingual Artificial Text Extraction and Script Identification from Video Images

Abstract: Abstract-This work presents a system for extraction and script identification of multilingual artificial text appearing in video images. As opposed to most of the existing text extraction systems which target textual occurrences in a particular script or language, we have proposed a generic multilingual text extraction system that relies on a combination of unsupervised and supervised techniques. The unsupervised approach is based on application of image analysis techniques which exploit the contrast, alignmen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 51 publications
(63 reference statements)
0
7
0
Order By: Relevance
“…Script recognition has been studied by researchers for text in video images as well as printed and handwritten documents [50,51]. Recognition of script in video text is naturally much more challenging as opposed to printed or handwritten documents due to low resolution of text and in some cases complex backgrounds [52,53]. From simple methods based on template matching [54] to sophisticated structural [55] and statistical [56] features, a number of techniques have been reported in the literature.…”
Section: Script Recognitionmentioning
confidence: 99%
“…Script recognition has been studied by researchers for text in video images as well as printed and handwritten documents [50,51]. Recognition of script in video text is naturally much more challenging as opposed to printed or handwritten documents due to low resolution of text and in some cases complex backgrounds [52,53]. From simple methods based on template matching [54] to sophisticated structural [55] and statistical [56] features, a number of techniques have been reported in the literature.…”
Section: Script Recognitionmentioning
confidence: 99%
“…Much of the current research on Urdu recognition is performed on the cleaned and segmented artificially generated Urdu Nastaliq text such as Urdu Printed Text Images (UPTI) [24], custom extracted [15], generated text with clear background [25], video tickers [26] or handwritten Urdu text [27] as opposed to extracting from outdoor or real-world images with complex background. This work is a step in that direction that integrates synthetic Urdu-text in natural outdoor images.…”
Section: Introductionmentioning
confidence: 99%
“…While for recognition of Urdu characters from outdoor images there are few custom datasets [11], [15], [25] and for recognition of printed characters words there is a famous dataset UPTI [24], which recently has been updated and has been presented with name UPTI2.0 [38] because the performance on UPTI has reached near saturation [33], [35]. There also exist CLE-18000 [32], [39] which contains near 18K ligatures (compound characters).…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations