2020
DOI: 10.14569/ijacsa.2020.0110424
|View full text |Cite
|
Sign up to set email alerts
|

Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition

Abstract: Developing systems for interpreting visuals, such as images, videos is really challenging but important task to be developed and applied on benchmark datasets. This study solves the very challenge by using STN-OCR model consisting of deep neural networks (DNN) and Spatial Transformer Networks (STNs). The network architecture of this study consists of two stages: localization network and recognition network. In the localization network it finds and localizes text regions and generates sampling grid. Whereas, in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…This approach did not meet the necessary general symmetry points due to the small font, low contrast, and complex background in the video. The Discrete Cosine Transform (DCT) factors of the intensity images have been extensively deployed as the texture elements for text detection [16][17][18].…”
Section: Pattern Recognition -Pattern Recognition Involvesmentioning
confidence: 99%
“…This approach did not meet the necessary general symmetry points due to the small font, low contrast, and complex background in the video. The Discrete Cosine Transform (DCT) factors of the intensity images have been extensively deployed as the texture elements for text detection [16][17][18].…”
Section: Pattern Recognition -Pattern Recognition Involvesmentioning
confidence: 99%