Cross-modal Neural Sign Language Translation

Duarte, Amanda

doi:10.1145/3343031.3352587

Cited by 28 publications

(12 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Distinct to SLR, the task of SLT was recently introduced by Camgoz et al [7], aiming to directly translate sign videos to spoken language sentences [15,31,45,62]. SLT is more challenging than CSLR due to the differences in grammar and ordering between sign and spoken language.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Progressive Transformers for End-to-End Sign Language Production

Saunders

Camgöz

Bowden

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The goal of automatic Sign Language Production (SLP) is to translate spoken language to a continuous stream of sign language video at a level comparable to a human translator. If this was achievable, then it would revolutionise Deaf hearing communications. Previous work on predominantly isolated SLP has shown the need for architectures that are better suited to the continuous domain of full sign sequences. In this paper, we propose Progressive Transformers, the first SLP model to translate from discrete spoken language sentences to continuous 3D sign pose sequences in an end-to-end manner. A novel counter decoding technique is introduced, that enables continuous sequence generation at training and inference. We present two model configurations, an end-toend network that produces sign direct from text and a stacked network that utilises a gloss intermediary. We also provide several data augmentation processes to overcome the problem of drift and drastically improve the performance of SLP models. We propose a back translation evaluation mechanism for SLP, presenting benchmark quantitative results on the challenging RWTH-PHOENIX-Weather-2014T (PHOENIX14T) dataset and setting baselines for future research. Code available at https://github.com/BenSaunders27/ ProgressiveTransformersSLP.

show abstract

Section: Related Workmentioning

confidence: 99%

“…Recently, deep learning approaches have been applied to the task of SLP [15,53,61]. Stoll et al present an initial SLP model using a combination of Neural Machine Translation (NMT) and Generative Adversarial Networks (GANs) [54].…”

Section: Related Workmentioning

confidence: 99%

Progressive Transformers for End-to-End Sign Language Production

Saunders

Camgöz

Bowden

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…For the model, we created a simple 2D human (see figure 1a) in Inkscape 3 . We used a 2D model instead of a 3D model because OpenPose can only estimate a 2D pose from a single camera perspective.…”

Section: Virtual Human Agent Modelmentioning

confidence: 99%

“…An early example is TESSA [2], which translated from spoken language to British Sign Language in the postoffice domain. More recent approaches have adopted sequenceto-sequence approaches to translate a sequence of text into the corresponding signs [3,5]. However, these examples of automatic sign language generation all suffer from the use of technologies that are not highly available (motion capture, depth cameras, and large computational power respectively) leading to them being less general, portable, scalable, and ultimately usable.…”

Section: Introductionmentioning

confidence: 99%

Two Dimensional Sign Language Agent

McConnell

Foster

2020

Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents

View full text Add to dashboard Cite

“…SLR is a field dedicated to the automated interpretation of hand gestures and other signs used in communications between people with a speech or hearing impairment. Because hardware and software components have evolved to the point where developing advanced systems with real-time translation capacities appear to be within reach, a large number of exciting and innovative solutions have been proposed and tested in recent years [5]- [9]with the objective of building fully functional systems that can understand sign language and respond to commands given in this format. However, before any truly practical applications can be considered, it is imperative to perfect the interpretation algorithms to the point where false positives are rare [6], [10]- [13].…”

Section: Introductionmentioning

confidence: 99%

Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues

Al‐Qurishi¹,

Khalid²,

Souissi³

2021

IEEE Access

View full text Add to dashboard Cite

People with hearing impairments are found worldwide; therefore, the development of effective local level sign language recognition (SLR) tools is essential. We conducted a comprehensive review of automated sign language recognition based on machine/deep learning methods and techniques published between 2014 and 2021 and concluded that the current methods require conceptual classification to interpret all available data correctly. Thus, we turned our attention to elements that are common to almost all sign language recognition methodologies. This paper discusses their relative strengths and weaknesses, and we propose a general framework for researchers. This study also indicates that input modalities bear great significance in this field; it appears that recognition based on a combination of data sources, including vision-based and sensor-based channels, is superior to a unimodal analysis. In addition, recent advances have allowed researchers to move from simple recognition of sign language characters and words towards the capacity to translate continuous sign language communication with minimal delay. Many of the presented models are relatively effective for a range of tasks, but none currently possess the necessary generalization potential for commercial deployment. However, the pace of research is encouraging, and further progress is expected if specific difficulties are resolved.

show abstract

Cross-modal Neural Sign Language Translation

Cited by 28 publications

References 31 publications

Progressive Transformers for End-to-End Sign Language Production

Progressive Transformers for End-to-End Sign Language Production

Two Dimensional Sign Language Agent

Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues

Contact Info

Product

Resources

About