IberSPEECH 2022 2022
DOI: 10.21437/iberspeech.2022-9
|View full text |Cite
|
Sign up to set email alerts
|

Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish

Abstract: Different studies have shown the importance of visual cues throughout the speech perception process. In fact, the development of audiovisual approaches has led to advances in the field of speech technologies. However, although noticeable results have recently been achieved, visual speech recognition remains an open research problem. It is a task in which, by dispensing with the auditory sense, challenges such as visual ambiguities and the complexity of modeling silence must be faced. Nonetheless, some of these… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 38 publications
0
1
0
Order By: Relevance
“…Currently, our best results with this database have been obtained with the pretrained CTC/Attention architecture proposed by Ma et al [22], with a WER of around 40% in the speaker-dependent partition. Considering methods based on the fine-tuning technique, we then studied the development of speakeradapted VSR systems [54], a work that is the basis of this conference paper extension.…”
Section: Related Workmentioning
confidence: 99%
“…Currently, our best results with this database have been obtained with the pretrained CTC/Attention architecture proposed by Ma et al [22], with a WER of around 40% in the speaker-dependent partition. Considering methods based on the fine-tuning technique, we then studied the development of speakeradapted VSR systems [54], a work that is the basis of this conference paper extension.…”
Section: Related Workmentioning
confidence: 99%