2023
DOI: 10.3390/electronics12061395
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Neural Network for Text Recognition in Natural Scenes Based on End-to-End Multi-Scale Attention Mechanism

Abstract: Text recognition in natural scenes has been a very challenging task in recent years, and rich text semantic information is of great significance for the understanding of a scene. However, text images in natural scenes often contain a lot of noise data, which leads to error detection. The problems of high error detection rate and low recognition accuracy have brought great challenges to the task of text recognition. To solve this problem, we propose a text recognition algorithm based on natural scenes. First, t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 39 publications
0
1
0
Order By: Relevance
“…Among them, TP represents the number of characters correctly recognized by OCR, and FP represents the number of characters incorrectly recognized by OCR [42].…”
Section: Ocr Evaluation Indexmentioning
confidence: 99%
“…Among them, TP represents the number of characters correctly recognized by OCR, and FP represents the number of characters incorrectly recognized by OCR [42].…”
Section: Ocr Evaluation Indexmentioning
confidence: 99%
“…Compared with the hyper-LPR model, the UNET-GWO-SVM model designed in this paper dramatically improves the positioning accuracy and recognition speed. On the other hand, there is still a gap compared with different end-to-end neural network recognition algorithms [41][42][43][44]. However, they consider the applicability of embedded hardware.…”
Section: License Plate Recognition Experimentsmentioning
confidence: 99%
“…Neural networks can still be applied to the realm of text recognition, but necessitate adjustments to network architecture. Numerous backbone networks for object recognition [35,36], including VggNet [37,38], ResNet [39,40], and DenseNet [41,42], are also highly applicable to text recognition, elevating the recognition accuracy of individual characters. Dataset training is an indispensable component of neural network training, but in the course of researching vehicular screen text recognition, we discovered an almost complete absence of mature datasets within this field.…”
Section: Introductionmentioning
confidence: 99%