2020
DOI: 10.48550/arxiv.2009.10874
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…Taking into account the transformer-based methods have the advantages of parallel computation in the field of natural language processing, Zhu et al [15] directly combined heavy backbone and the whole transformer network as a new text recognizer, and a hierarchical attention mechanism including four self-attention blocks was proposed in the encoding part to describe context information, which is also a heavy model. To make text recognition technology more suitable for real-world applications, Li et al [26] proposed a transformer-based model, a self-attention mechanism was applied to text recognition to reach a better ideal state. This model utilized localitysensitive hashing instead of softmax regression to compress model.…”
Section: Related Workmentioning
confidence: 99%
“…Taking into account the transformer-based methods have the advantages of parallel computation in the field of natural language processing, Zhu et al [15] directly combined heavy backbone and the whole transformer network as a new text recognizer, and a hierarchical attention mechanism including four self-attention blocks was proposed in the encoding part to describe context information, which is also a heavy model. To make text recognition technology more suitable for real-world applications, Li et al [26] proposed a transformer-based model, a self-attention mechanism was applied to text recognition to reach a better ideal state. This model utilized localitysensitive hashing instead of softmax regression to compress model.…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, Hamming optical character recognition (OCR) was proposed on the basis of the transformer structure, which solves the problem of too many categories leading to too many model parameters. (11) In recent years, most of the research on text recognition has been carried out simultaneously with text detection, and end-to-end detection recognition models such as fast oriented text spotting (FOTS) and (Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text) PAN++ have been proposed, both of which have achieved good results. (12,13) Named entity recognition (NER) methods aim at recognizing entities of interest in text such as location, organization, and time.…”
Section: Introductionmentioning
confidence: 99%