2022
DOI: 10.7717/peerj-cs.1093
|View full text |Cite
|
Sign up to set email alerts
|

S-Swin Transformer: simplified Swin Transformer model for offline handwritten Chinese character recognition

Abstract: The Transformer shows good prospects in computer vision. However, the Swin Transformer model has the disadvantage of a large number of parameters and high computational effort. To effectively solve these problems of the model, a simplified Swin Transformer (S-Swin Transformer) model was proposed in this article for handwritten Chinese character recognition. The model simplifies the initial four hierarchical stages into three hierarchical stages. In addition, the new model increases the size of the window in th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 41 publications
0
1
0
Order By: Relevance
“…Next, the results obtained after the first stage processing are input to the second stage, and feature extraction and exchange are continued, with the number of each token reduced to H/8 × W/8 and the output dimension expanded to 2 C [36]. Similarly, the above operation is repeated next, with the number of tokens in the third and fourth stages being H/16 × W/16 and H/32 × W/32, respectively.…”
Section: Swin Transformer Modelmentioning
confidence: 99%
“…Next, the results obtained after the first stage processing are input to the second stage, and feature extraction and exchange are continued, with the number of each token reduced to H/8 × W/8 and the output dimension expanded to 2 C [36]. Similarly, the above operation is repeated next, with the number of tokens in the third and fourth stages being H/16 × W/16 and H/32 × W/32, respectively.…”
Section: Swin Transformer Modelmentioning
confidence: 99%
“…Before, the man-made reasoning field has achieved a quantum jump. Presently, studies using man-made brainpower are directed and different fields are applied [1][2][3]. Particularly, attributable to the improvement is done in figuring power, in the field of PC vision the areas of use continually become more extensive.…”
Section: Introductionmentioning
confidence: 99%
“…The information is as bitmap picture. Disconnected written by hand acknowledgment [1] is similarly troublesome, as various individuals have different penmanship styles, and at times, the composing style of a similar individual fluctuates at various examples. On the web penmanship acknowledgment framework is the one in which they perceive the text of transcribed, while composing through the touch cushion utilizing pointer pen.…”
Section: Introductionmentioning
confidence: 99%