Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-2080
|View full text |Cite
|
Sign up to set email alerts
|

Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion

Abstract: This paper presents an analysis of the influence of various system parameters on the output quality of our neural network based real-time EMG-to-Speech conversion system. This EMG-to-Speech system allows for the direct conversion of facial surface electromyographic signals into audible speech in real time, allowing for a closed-loop setup where users get direct audio feedback. Such a setup opens new avenues for research and applications through co-adaptation approaches. In this paper, we evaluate the influence… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…These low values can only be achieved through direct speech synthesis. In this sense, real-time SSI systems have been developed for sEMG [181], [182], PMA [183] and EMA [27]. There is also the possibility that real-time auditory feedback might enable the brain to assimilate the SSI as if it were the person's own voice, thus enabling the user to adapt her/his own speaking patterns to produce better acoustics.…”
Section: Comparison Of the Two Ssi Approachesmentioning
confidence: 99%
See 3 more Smart Citations
“…These low values can only be achieved through direct speech synthesis. In this sense, real-time SSI systems have been developed for sEMG [181], [182], PMA [183] and EMA [27]. There is also the possibility that real-time auditory feedback might enable the brain to assimilate the SSI as if it were the person's own voice, thus enabling the user to adapt her/his own speaking patterns to produce better acoustics.…”
Section: Comparison Of the Two Ssi Approachesmentioning
confidence: 99%
“…Direct speech synthesis from EMG signals has also progressed considerably in recent years (see [31], [99], [181], [182]), following advances in array sEMG sensors and deep learning. As mentioned above, a particular advantage of EMG with respect to other techniques for articulator motion capture is that EMG signals can be sensed~60 ms before the actual movements of the articulators.…”
Section: ) Ssis Based On Emg Signalsmentioning
confidence: 99%
See 2 more Smart Citations
“…extremely noisy environments and/or military situations). For this automatic conversion task, typically electromagnetic articulography (EMA, [3,19,20]), ultrasound tongue imaging (UTI, [4,14,18,28]), permanent magnetic articulography (PMA, [10]), surface Electromyography (sEMG, [6,16,22]), lip video [1,7] and multimodal approaches are used [5]. Current SSI systems mostly apply the "direct synthesis" principle, where speech is generated without an intermediate step, directly from the articulatory data.…”
Section: Introductionmentioning
confidence: 99%