2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00455
|View full text |Cite
|
Sign up to set email alerts
|

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 85 publications
(33 citation statements)
references
References 59 publications
0
32
0
1
Order By: Relevance
“…Zhong et al [9] added a balance module to the decoder, which can reduce the impact of noise on characters. Huang et al [10] used the results of text detection to correct the loss of text recognition and achieved better recognition performance. He et al [11] employed the segmentation network to segment the characters, generated the corresponding graph node network according to the segmentation graph, and finally utilized the graph neural network for recognition.…”
Section: Deep Learning-based Text Recognition Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Zhong et al [9] added a balance module to the decoder, which can reduce the impact of noise on characters. Huang et al [10] used the results of text detection to correct the loss of text recognition and achieved better recognition performance. He et al [11] employed the segmentation network to segment the characters, generated the corresponding graph node network according to the segmentation graph, and finally utilized the graph neural network for recognition.…”
Section: Deep Learning-based Text Recognition Methodsmentioning
confidence: 99%
“…where is the sigmoid function, and f 7×7 indicates that the convolutional kernel is a 7 × 7 convolution operation. For convenience, we set the parameters according to references [6] and [10]. Extensive experiments have proved that the parameter settings in references [6] and [10] can obtain satisfactory results in different computer vision tasks.…”
Section: Convolutional Block Attention Modulementioning
confidence: 99%
See 2 more Smart Citations
“…al., 2022) They proposed SwinTextSpotter for text detection and text recognition. The overall architecture of SwinTextSpotter consists of four components: a backbone based on Swin-Transformer; a query-based text detector; a Recognition Conversion module to bridge the text detector and recognizer; and an attention-based recognizer [10]. This method gives a small percentage (66.9%) in determining the texts affiliated with the (Total Text) data set.…”
Section: Related Workmentioning
confidence: 99%