2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00550
|View full text |Cite
|
Sign up to set email alerts
|

Fingerspelling Recognition in the Wild With Iterative Visual Attention

Abstract: Sign language recognition is a challenging gesture sequence recognition problem, characterized by quick and highly coarticulated motion. In this paper we focus on recognition of fingerspelling sequences in American Sign Language (ASL) videos collected in the wild, mainly from YouTube and Deaf social media. Most previous work on sign language recognition has focused on controlled settings where the data is recorded in a studio environment and the number of signers is limited. Our work aims to address the challe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
63
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(64 citation statements)
references
References 33 publications
1
63
0
Order By: Relevance
“…Due to the basic status of convolutional neural networks (CNN) in deep learning networks, some research teams have conducted a series of CNN-based isolated sign language recognition studies since 2013 [ 6 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 ]. Based on CNN recognition, the algorithm can be optimized by adding multi-modal data (including depth, skeleton, key points of the human body, etc.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the basic status of convolutional neural networks (CNN) in deep learning networks, some research teams have conducted a series of CNN-based isolated sign language recognition studies since 2013 [ 6 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 ]. Based on CNN recognition, the algorithm can be optimized by adding multi-modal data (including depth, skeleton, key points of the human body, etc.…”
Section: Related Workmentioning
confidence: 99%
“…Pre-Trained Models. The pre-trained models considered for this paper are all available for use with Keras 7 , which is the DL API that is used for this paper. All models are specialized in image classification, trained on the dataset ImageNet 8 .…”
Section: Transfer Learningmentioning
confidence: 99%
“…"This article is part of the topical collection "Recent Trends in Computer Vision" guest edited by P. Nagabhushan, Balasubramaniyan Raman, Satish Kumar Singh and Subrahmanyam Murala" Since the differences between signs in sign languages largely consist of different patterns like hand movements and shiftings, using artificial neural networks that learn from datasets of sign images is useful to develop a model able to detect and interpret signs [7].…”
Section: Introductionmentioning
confidence: 99%
“…These models learn the relevant spatial or temporal parts of the image or video automatically from data. These models have also been used in the SLR domain [2], [8], [34], [36], [40].…”
Section: Related Workmentioning
confidence: 99%
“…In [40], ASL fingerspelling recognition model is proposed with iterative visual attention mechanism for real-life data. Fingerspelling is a part of sign language in which words are signed letter by letter.…”
Section: A Sign Language Datasetsmentioning
confidence: 99%