Exploring Silent Speech Interfaces Based on Frequency-Modulated Continuous-Wave Radar

Ferreira, David; Silva, Samuel; Curado, Francisco; Teixeira, António

doi:10.3390/s22020649

Cited by 15 publications

(11 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The result of 95% approves that the mouth motion produces informative signals for UWB sensing 7 . FMCW radar is also an optional choice which has been proven in the result of the paper 8 . The mentioned work adopts point clouds of human mouth while speaking as data feature for classification work of 13 words with 4 speakers.…”

Section: Methodsmentioning

confidence: 97%

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

Ge,

Tang,

et al. 2023

Sci Data

View full text Add to dashboard Cite

Small-scale motion detection using non-invasive remote sensing techniques has recently garnered significant interest in the field of speech recognition. Our dataset paper aims to facilitate the enhancement and restoration of speech information from diverse data sources for speakers. In this paper, we introduce a novel multimodal dataset based on Radio Frequency, visual, text, audio, laser and lip landmark information, also called RVTALL. Specifically, the dataset consists of 7.5 GHz Channel Impulse Response (CIR) data from ultra-wideband (UWB) radars, 77 GHz frequency modulated continuous wave (FMCW) data from millimeter wave (mmWave) radar, visual and audio information, lip landmarks and laser data, offering a unique multimodal approach to speech recognition research. Meanwhile, a depth camera is adopted to record the landmarks of the subject’s lip and voice. Approximately 400 minutes of annotated speech profiles are provided, which are collected from 20 participants speaking 5 vowels, 15 words, and 16 sentences. The dataset has been validated and has potential for the investigation of lip reading and multimodal speech recognition.

show abstract

Section: Methodsmentioning

confidence: 97%

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

Ge,

Tang,

et al. 2023

Sci Data

View full text Add to dashboard Cite

show abstract

“…FMCW radar utilizes continuous wave signals, whereas IR-UWB radar employs pulse signals for its functionality. Thus, the signal processing algorithm used in the FMCW radar-based SSR studies of [29], [30], and [31] cannot be directly applied to our IR-UWB radar-based SSR study.…”

Section: Principles Of Ir-uwb Radar-based Ssrmentioning

confidence: 99%

“…As indicated in Table 2, with the same vowel corpus as that used in our study, the method proposed in [28] achieved an accuracy of only 51.59%, whereas our method (FERASEC + DNN-HMM) achieved a significantly higher accuracy of 86.47%. Ferreira et al [30] reported an accuracy of 88.3% for a 13-word European Portuguese classification task with four participants using an FMCW radar.…”

Section: Performance Comparison With Other Contactless Radar-based Ss...mentioning

confidence: 99%

“…Surface electromyography [16], [17], vision [18], [19], [20], [21], [22], [23], ultrasound imaging [24], [25], and radar [26], [27], [28], [29], [30], [31] are techniques for capturing nonacoustic speech-related biosignals without the need to place sensors inside the oral cavity. Although these techniques are more convenient than the aforementioned ones, they have some shortcomings.…”

Section: Introductionmentioning

confidence: 99%

“…Ferreira et al [30] performed a speech recognition task with 13 isolated European Portuguese words using an FMCW radar. They utilized velocity dispersion data as speech features and successfully demonstrated that these features were capable of classifying distinguishable words.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases

Lee,

Shin,

Kim

et al. 2023

IEEE Access

View full text Add to dashboard Cite

show abstract

Human-inspired computational models for European Portuguese: a review

Teixeira

Silva

2023

Lang Resources & Evaluation

View full text Add to dashboard Cite

This paper surveys human-inspired speech technologies developed for European Portuguese and the computational models they integrate and made them possible. In this regard, it covers systems for synthesis and recognition as well as information on the methods adopted for the speech production studies that were performed, in parallel, to support them. And, on doing so, it can also contribute to provide an entry point for those who work in the field but are not familiar with these particular areas, including: context, history, and comprehensive references. As the great majority of work in these areas for European Portuguese was done by the first author’s research group, this paper can also be seen as a review of more than 25 years of research at University of Aveiro in these topics.

show abstract

Exploring Silent Speech Interfaces Based on Frequency-Modulated Continuous-Wave Radar

Cited by 15 publications

References 45 publications

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases

Human-inspired computational models for European Portuguese: a review

Contact Info

Product

Resources

About