2020
DOI: 10.48550/arxiv.2002.01276
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition

Abstract: Connectionist Temporal Classification (CTC) and attention mechanism are two main approaches used in recent scene text recognition works. Compared with attention-based methods, CTC decoder has a much shorter inference time, yet a lower accuracy. To design an efficient and effective model, we propose the guided training of CTC (GTC), where CTC model learns a better alignment and feature representations from a more powerful attentional guidance. With the benefit of guided training, CTC model achieves robust and a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…Moreover, inspired by SRN (Yu et al 2020) and GTC (Hu et al 2020), we propose a graph refinement module (GRM) to make secondary reasoning to improve the end-to-end performance further. The points in a text sequence are formulated as nodes in a graph, where the representation of each node is enhanced with semantic context and visual context information from its neighbors, and the character classification result should be more accurate.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, inspired by SRN (Yu et al 2020) and GTC (Hu et al 2020), we propose a graph refinement module (GRM) to make secondary reasoning to improve the end-to-end performance further. The points in a text sequence are formulated as nodes in a graph, where the representation of each node is enhanced with semantic context and visual context information from its neighbors, and the character classification result should be more accurate.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, the pursuit of high performance on benchmarks has drawn much attention from the community. Driven by deep learning [50,31,2,33,12] and large volume of synthetic data [13,29,46], the recognition accuracy on standard benchmarks has escalated rapidly. For instance, the accuracy on IIIT-5k [27] without lexicon has increased from 78.2% [31] to 96.0% [12] in a very short period.…”
Section: Introductionmentioning
confidence: 99%