2018 IEEE Spoken Language Technology Workshop (SLT) 2018
DOI: 10.1109/slt.2018.8639639
|View full text |Cite
|
Sign up to set email alerts
|

American Sign Language Fingerspelling Recognition in the Wild

Abstract: We address the problem of American Sign Language fingerspelling recognition "in the wild", using videos collected from websites. We introduce the largest data set available so far for the problem of fingerspelling recognition, and the first using naturally occurring video data. Using this data set, we present the first attempt to recognize fingerspelling sequences in this challenging setting. Unlike prior work, our video data is extremely challenging due to low frame rates and visual variability. To tackle the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
70
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 61 publications
(81 citation statements)
references
References 33 publications
0
70
0
Order By: Relevance
“…We make two main contributions: (1) We propose iterative attention, an approach for obtaining regions of interest of high resolution with limited computation (see Figure 3). Our model trained with iterative attention achieves higher accuracy than the previous best approach [35], which requires a custom hand detector. We further show that even when a hand or face detector is available, our approach provides significant added value.…”
Section: Introductionmentioning
confidence: 82%
See 4 more Smart Citations
“…We make two main contributions: (1) We propose iterative attention, an approach for obtaining regions of interest of high resolution with limited computation (see Figure 3). Our model trained with iterative attention achieves higher accuracy than the previous best approach [35], which requires a custom hand detector. We further show that even when a hand or face detector is available, our approach provides significant added value.…”
Section: Introductionmentioning
confidence: 82%
“…In this paper we consider a constrained task (fingerspelling recognition), but with looser visual and stylistic constraints than in most previous work. The recently introduced Chicago Fingerspelling in the Wild (ChicagoFSWild) data set [35] consists of 7304 fingerspelling sequences from online videos. This data set includes a large number of signers (168) and a wide variety of challenging visual conditions, and we use it as one of our test beds.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations