2020
DOI: 10.1109/access.2020.3002808
|View full text |Cite
|
Sign up to set email alerts
|

Cluttered TextSpotter: An End-to-End Trainable Light-Weight Scene Text Spotter for Cluttered Environment

Abstract: Scene text spotting aims at simultaneously localizing and recognizing text instances, symbols, and logos in natural scene images. Scene text detection and recognition approaches have received immense attention in computer vision research community. The presence of partial occlusion or truncation artifact due to the cluttered background of scene images creates an obstacle in perceiving the text instances, which makes the process of spotting very complex. In this paper, we propose a lightweight scene text spotte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 88 publications
(129 reference statements)
0
7
0
Order By: Relevance
“…This approach largely depends on the results of the character detection step and therefore, accumulated errors are the major concern in this approach. In word‐based text recognition methods, each word is considered as a whole and holistic word classification is commonly performed to achieve word recognition (Bagi et al, 2020a). A dictionary of segmented words may further need to be considered in this approach.…”
Section: Spotting ‐Based Mining Approachesmentioning
confidence: 99%
See 4 more Smart Citations
“…This approach largely depends on the results of the character detection step and therefore, accumulated errors are the major concern in this approach. In word‐based text recognition methods, each word is considered as a whole and holistic word classification is commonly performed to achieve word recognition (Bagi et al, 2020a). A dictionary of segmented words may further need to be considered in this approach.…”
Section: Spotting ‐Based Mining Approachesmentioning
confidence: 99%
“…A special backbone for text features and two different types of attention have been considered to achieve state‐of‐the‐art performance for both text spotting and script identification in natural images (Y. Zhou, Fang, et al, 2019). Recently, Bagi et al (2020a) have proposed an end‐to‐end trainable deep neural network based on local, global, and contextual information of multiscale feature maps of a lightweight backbone network for text spotting instances in scene images with background clutters, partially occluded text, truncation artifacts, and perspective distortions. The problem of inter‐class misclassification has been addressed by maximizing inter‐class separability and compacting intra‐class variability using Gaussian softmax.…”
Section: Spotting ‐Based Mining Approachesmentioning
confidence: 99%
See 3 more Smart Citations