2019 International Conference on Document Analysis and Recognition (ICDAR) 2019
DOI: 10.1109/icdar.2019.00095
|View full text |Cite
|
Sign up to set email alerts
|

Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning

Abstract: Detecting curved text in the wild is very challenging. Recently, most state-of-the-art methods are segmentation based and require pixel-level annotations. We propose a novel scheme to train an accurate text detector using only a small amount of pixel-level annotated data and a large amount of data annotated with rectangles or even unlabeled data. A baseline model is first obtained by training with the pixellevel annotated data and then used to annotate unlabeled or weakly labeled data. A novel strategy which u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
4
4
1

Relationship

5
4

Authors

Journals

citations
Cited by 38 publications
(12 citation statements)
references
References 32 publications
0
12
0
Order By: Relevance
“…Different convolutional deep learning neural network based methods have recently been used as feature backbone to extract features in order to appropriately handle the text of different scales (Gao et al, 2019; S. Qin, Bissacco, et al, 2019). Features have been extracted by using the output of one or more of the hidden layers in CNN (Gao et al, 2019; X. Qin, Zhou, et al, 2019). Sharing features extracted from CNN has also been used to extend a character classification method to character detection and bigram classification.…”
Section: Spotting ‐Based Mining Approachesmentioning
confidence: 99%
“…Different convolutional deep learning neural network based methods have recently been used as feature backbone to extract features in order to appropriately handle the text of different scales (Gao et al, 2019; S. Qin, Bissacco, et al, 2019). Features have been extracted by using the output of one or more of the hidden layers in CNN (Gao et al, 2019; X. Qin, Zhou, et al, 2019). Sharing features extracted from CNN has also been used to extend a character classification method to character detection and bigram classification.…”
Section: Spotting ‐Based Mining Approachesmentioning
confidence: 99%
“…In regression-based methods, geometry of text is directly predicted from convolutional features [2, 11, 12, 17, 19, 22, 23, 42, 50-52, 56, 77] or RoI features [25,37,55,72], and then used to decode to produce the predicted results based on given reference points or boxes. In instance segmentation based methods, typically, Mask R-CNN based methods [28,33,43,57,59,60], an extra branch is added to a detection framework. The results are achieved via instance segmentation, getting rids of learning target confusion problem [26,61] which exists in regression-based methods.…”
Section: Related Workmentioning
confidence: 99%
“…Mask TextSpotter [11] is the first end-to-end trainable arbitraryshaped scene text spotter with a detection module based on Mask R-CNN. Qin et al [15] reduce the requirement of pixel-level annotations with weakly-supervised learning. Chen et al [2] propose a self-training framework with unannotated videos based on Mask R-CNN.…”
Section: Related Workmentioning
confidence: 99%