Automatic script identification in the wild

Shi, Baoguang; Yao, Cong; Zhang, Chengquan; Guo, Xiaowei; Huang, Feiyue; Bai, Xiang

doi:10.1109/icdar.2015.7333818

Cited by 46 publications

(30 citation statements)

References 25 publications

(29 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A much more recent approach to scene text script identification is provided by Shi et al [4] where the authors propose the Multi-stage Spatially-sensitive Pooling Network (MSPN). The MSPN network overcomes the limitation of having a fixed size input in traditional Convolutional Neural Networks by pooling along each row of the intermediate layers' outputs by taking the maximum (or average) value in each row.…”

Section: Related Workmentioning

confidence: 99%

“…provided by the user, or inferred from available meta-data. The unconstrained text understanding problem for large collections of images from unknown sources has not been considered up to very recently [4]. While there exists some research in script identification of text over complex backgrounds [5], [6], such methods have been so far limited to video overlaid-text, which presents in general different challenges than scene text.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Fine-Grained Approach to Scene Text Script Identification

Gómez

Karatzas

2016

2016 12th IAPR Workshop on Document Analysis Systems (DAS)

View full text Add to dashboard Cite

Abstract-This paper focuses on the problem of script identification in unconstrained scenarios. Script identification is an important prerequisite to recognition, and an indispensable condition for automatic text understanding systems designed for multi-language environments. Although widely studied for document images and handwritten documents, it remains an almost unexplored territory for scene text images.We detail a novel method for script identification in natural images that combines convolutional features and the Naive-Bayes Nearest Neighbor classifier. The proposed framework efficiently exploits the discriminative power of small stroke-parts, in a finegrained classification framework.In addition, we propose a new public benchmark dataset for the evaluation of joint text detection and script identification in natural scenes. Experiments done in this new dataset demonstrate that the proposed method yields state of the art results, while it generalizes well to different datasets and variable number of scripts. The evidence provided shows that multi-lingual scene text recognition in the wild is a viable proposition. Source code of the proposed method is made available online.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Fine-Grained Approach to Scene Text Script Identification

Gómez

Karatzas

2016

2016 12th IAPR Workshop on Document Analysis Systems (DAS)

View full text Add to dashboard Cite

show abstract

“…Most of these datasets are used for scene text localization and recognition in English. There are also few datasets [8,1] of multiple scripts e.g., east Asian languages or Indian language video text. In this work we introduce Indian Language Scene Text (ILST) dataset which is a comprehensive dataset for Indian language scene text containing six scripts commonly used in India, namely Telugu, Tamil, Malayalam, Kannada, Hindi and English.…”

Section: A the Ilst Datasetmentioning

confidence: 99%

“…There are many methods in the literature for script identification [1,2,5,6,7,8]. Texture based features such as Gabor filter [7], LBP [9] have been used for script identification.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Simple and Effective Solution for Script Identification in the Wild

Singh

Mishra

Dabral

et al. 2016

2016 12th IAPR Workshop on Document Analysis Systems (DAS)

View full text Add to dashboard Cite

We present an approach for automatically identifying the script of the text localized in the scene images. Our approach is inspired by the advancements in mid-level features. We represent the text images using mid-level features which are pooled from densely computed local features. Once text images are represented using the proposed mid-level feature representation, we use an off-the-shelf classifier to identify the script of the text image. Our approach is efficient and requires very less labeled data. We evaluate the performance of our method on a recently introduced CVSI dataset, demonstrating that the proposed approach can correctly identify script of 96.70% of the text images. In addition, we also introduce and benchmark a more challenging Indian Language Scene Text (ILST) dataset for evaluating the performance of our method.

show abstract

Document Language Classification: Hierarchical Model with Deep Learning Approach

Shah

Joshi

2021

Computer Analysis of Images and Patterns

View full text Add to dashboard Cite

Automatic script identification in the wild

Cited by 46 publications

References 25 publications

A Fine-Grained Approach to Scene Text Script Identification

A Fine-Grained Approach to Scene Text Script Identification

A Simple and Effective Solution for Script Identification in the Wild

Document Language Classification: Hierarchical Model with Deep Learning Approach

Contact Info

Product

Resources

About