2023
DOI: 10.1016/j.eswa.2022.119394
|View full text |Cite
|
Sign up to set email alerts
|

Automatic translation of sign language with multi-stream 3D CNN and generation of artificial depth maps

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
4
0
3

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 39 publications
(7 citation statements)
references
References 30 publications
0
4
0
3
Order By: Relevance
“…ISLR shares a lot of features with action recognition, and consequently there are several works using CNNs for feature extraction and classification [ 32 , 33 , 34 , 35 ]. Recent work has also relied on employing 3D-CNNs [ 36 , 37 ] to capture spatiotemporal information in an ensemble way. In [ 38 , 39 , 40 ], an Inflated 3D ConvNet (I3D) architecture [ 22 ] is proposed, whose application produces significant improvements in ISLR performance.…”
Section: Related Workmentioning
confidence: 99%
“…ISLR shares a lot of features with action recognition, and consequently there are several works using CNNs for feature extraction and classification [ 32 , 33 , 34 , 35 ]. Recent work has also relied on employing 3D-CNNs [ 36 , 37 ] to capture spatiotemporal information in an ensemble way. In [ 38 , 39 , 40 ], an Inflated 3D ConvNet (I3D) architecture [ 22 ] is proposed, whose application produces significant improvements in ISLR performance.…”
Section: Related Workmentioning
confidence: 99%
“…Although the test accuracy of the proposed method is high, the number of classes is quite low compared to the number of words used in general sign language dictionaries. Castro et al [50] introduced a multi-stream approach involving processing summarized RGB frames, segmented regions of the hands and face, joint distances, and artificially generated depth data through a 3D-CNN. In this method, it was shown that the addition of artificial depth maps increased the generalization capacity for different signers.…”
Section: Related Literaturementioning
confidence: 99%
“…[31][32][33][34][35]. This is considered a more challenging task as there is no predefined dataset available for regional languages and all the time authors must collect their own dataset for very few postures [36,37]. The good thing about sensor-based prototypes is that they are each worn and carried in public.…”
Section: Literature Reviewmentioning
confidence: 99%