2021 18th Conference on Robots and Vision (CRV) 2021
DOI: 10.1109/crv52889.2021.00024
|View full text |Cite
|
Sign up to set email alerts
|

2LSPE: 2D Learnable Sinusoidal Positional Encoding using Transformer for Scene Text Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…In addition, the authors proposed an adaptive 2D positional encoding as well as a locality-aware feed-forward module in the transformer encoder. With a focus on the positional encoding of transformer, Raisi et al [31] applied a 2D learnable sinusoidal positional encoding which enables the CNNtransformer encoder to focus more on spatial dependencies.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, the authors proposed an adaptive 2D positional encoding as well as a locality-aware feed-forward module in the transformer encoder. With a focus on the positional encoding of transformer, Raisi et al [31] applied a 2D learnable sinusoidal positional encoding which enables the CNNtransformer encoder to focus more on spatial dependencies.…”
Section: Introductionmentioning
confidence: 99%