2021
DOI: 10.1007/978-3-030-78095-1_21
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Fusion and Sequence Learning for Cued Speech Recognition from Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…The technique is the best suited for lipreading applications and attained more than 80% in CS recognition [25]. Deep learning was also utilized to analyze hand shape and lip features using CNN models for CS recognition from videos [26]. A Malay CSR system was introduced to help the deaf learn and practice the basics of cued speech consonants and vowels using hand gestures [27].…”
Section: Multimodal Integration For Hard Of Hearingmentioning
confidence: 99%
“…The technique is the best suited for lipreading applications and attained more than 80% in CS recognition [25]. Deep learning was also utilized to analyze hand shape and lip features using CNN models for CS recognition from videos [26]. A Malay CSR system was introduced to help the deaf learn and practice the basics of cued speech consonants and vowels using hand gestures [27].…”
Section: Multimodal Integration For Hard Of Hearingmentioning
confidence: 99%
“…Text2sign [21] HLSTM [22] ESN [23] DiffGAN [24] RG [25] SEEG [26] HA2G [27] Paul Duchnowski [28] Gérard Bailly [29] Wav2lip [30] Audio2head [31] AD-NERF [32] DiffTalk [33] Rule-based [34] LBP [35] SDF [36] PTSLP [37] Syn [38] MMFSL [39] Re-Syn [40] CMML [41] DTW [42] HMMs [43] FCN [44] RL [45] CS [1] MCCS [46] SLreview [2] UnorgSign [47] CoSreview [6] CoSreview [3] SE [48] THreview [4] THE [49] VHTHG [50] Fig. 3: Structured taxonomy of the existing BL research which includes three genres.…”
Section: Sign Language Recognitionmentioning
confidence: 99%
“…More recently, traditional DL-based methods (i.e., CNNbased, LSTM-based) have been developed to alleviate the aforementioned problem. For instance, Sankar et al [39] propose a novel RNN model trained with a Connectionist Temporal Classification (CTC) loss [225]. Papatimitriou et al [170] propose a fully convolutional model with a time-depth separable block and attention-based decoder.…”
mentioning
confidence: 99%