2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00582
|View full text |Cite
|
Sign up to set email alerts
|

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
16
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 54 publications
(16 citation statements)
references
References 32 publications
0
16
0
Order By: Relevance
“…[32] proposes a content perceptual loss to extract multi-scale text recognition features to conduct a content aware supervision. TPGSR [12], TATT [13], and C3-STISR [14] extract text-specific clues to guide the superresolution. In particular, TPGSR is the first method that additionally introduces a scene text recognizer to provide text priors.…”
Section: B Scene Text Image Super-resolutionmentioning
confidence: 99%
See 2 more Smart Citations
“…[32] proposes a content perceptual loss to extract multi-scale text recognition features to conduct a content aware supervision. TPGSR [12], TATT [13], and C3-STISR [14] extract text-specific clues to guide the superresolution. In particular, TPGSR is the first method that additionally introduces a scene text recognizer to provide text priors.…”
Section: B Scene Text Image Super-resolutionmentioning
confidence: 99%
“…Then, the extracted priors are fed into the superresolution to iteratively benefit the super-resolution. TATT [13] introduces a transformer-based module, which leverages global attention mechanism, to exert the semantic guidance of text prior to the text reconstruction process. C3-STISR [14] is proposed to learn triple clues, including recognition clue from a STR, linguistical clue from a language model, and a visual clue from a skeleton painter to rich the representation of the text-specific clue.…”
Section: B Scene Text Image Super-resolutionmentioning
confidence: 99%
See 1 more Smart Citation
“…The research on scene text image super-resolution focuses on the recognizability of text images, and due to the variability and complexity of the text presentation form, there are still many problems and challenges in the current research. The Sequential Residual Block of some network models (Ma et al, 2022;Wang et al, 2020) consist of recurrent neural networks, which mainly use position-based operations and cannot effectively capture the long-distance correlation in text images, and performance degradation occurs when dealing with long text (Qin et al, 2022). Some scene text images have problems such as image occlusion and content missing, so using binarized masks as image inputs could not complement the semantic information of text images, resulting in poor processing results of such images.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, many prior-guided deep learning-based approaches have attracted much attention in the field of STISR, which can be roughly divided into three major categories: that is, semantic priorguided methods [1][2][3], structure prior-guided methods [5,9,10], and prior-hybrid methods [6].…”
mentioning
confidence: 99%