2022 44th Annual International Conference of the IEEE Engineering in Medicine &Amp; Biology Society (EMBC) 2022
DOI: 10.1109/embc48229.2022.9871489
|View full text |Cite
|
Sign up to set email alerts
|

ViT-HGR: Vision Transformer-based Hand Gesture Recognition from High Density Surface EMG Signals

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 28 publications
(14 citation statements)
references
References 20 publications
0
14
0
Order By: Relevance
“…This problem can only be solved by allowing the user to directly control the closing speed or by allowing so many speeds to be selected that the differences are effectively negligible. We consider the latter to be unfeasible, especially because the maximum of poses (as far as we are aware) that can be encoded with ≥ 90% accuracy is 65 [9] if we count isometric movements, or 52 [32] without isometric tasks.…”
Section: Discussionmentioning
confidence: 99%
“…This problem can only be solved by allowing the user to directly control the closing speed or by allowing so many speeds to be selected that the differences are effectively negligible. We consider the latter to be unfeasible, especially because the maximum of poses (as far as we are aware) that can be encoded with ≥ 90% accuracy is 65 [9] if we count isometric movements, or 52 [32] without isometric tasks.…”
Section: Discussionmentioning
confidence: 99%
“…The original ViT models substitute the inherent inductive bias favoring local processing in convolutions with global processing facilitated by multi-head self-attention, resulting in enhanced performance in vision tasks. Recently, Montazerin introduced the ViT-HGR framework, leveraging the ViT architecture to efficiently classify 65 hand gestures using HD-sEMG data with a significant accuracy of 84.6% [51], demonstrating its effectiveness in processing sequential signals like sEMG [51]. However, it is important to note that the patchification using linear projection in the original ViT may lead to the loss of neighborhood dependencies within patches, particularly in closely related sequential signals.…”
Section: Patch Embedding With 1d-cnnmentioning
confidence: 99%
“… Burrello et al (2022) proposed using Bioformers, a supersmall attention-based neural network architecture, to solve the problems of big resource consumption and difficult improvement of accuracy in sEMG signal gesture recognition tasks. Montazerin et al (2022) proposed a gesture recognition model called ViT-HGR, which is the first to introduce visual transformer architecture into high-density sEMG signal gesture recognition tasks, leveraging its parallel computing advantages to overcome the long training time and memory limitations of deep learning models. Liu et al (2023) proposed a CNN-Transformer hybrid network for high-accuracy dynamic gesture recognition.…”
Section: Introductionmentioning
confidence: 99%