Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-2926
|View full text |Cite
|
Sign up to set email alerts
|

Toward Silent Paralinguistics: Speech-to-EMG — Retrieving Articulatory Muscle Activity from Speech

Abstract: Electromyographic (EMG) signals recorded during speech production encode information on articulatory muscle activity and also on the facial expression of emotion, thus representing a speech-related biosignal with strong potential for paralinguistic applications. In this work, we estimate the electrical activity of the muscles responsible for speech articulation directly from the speech signal. To this end, we first perform a neural conversion of speech features into electromyographic time domain features, and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 29 publications
0
1
0
1
Order By: Relevance
“…We further calculate the concordance correlation coefficient (CCC) for each of the MFCCs, as estimated from the EMG data versus from the reference audio (Figure 2), suggesting that lower coefficients (i. e., large spectral changes) were predicted relatively well, whereas the prediction quality for the higher coefficients was degraded. As demonstrated by our complementary work on retrieving articulatory muscle activity from EMG [19], the CCC for this inverse SCP problem is around 0.62. I. e., results for Speech-to-EMG CCCs have been comparable to the prediction of the first two MFCCs in figure 2.…”
Section: Emg-to-speech Conversionmentioning
confidence: 85%
“…We further calculate the concordance correlation coefficient (CCC) for each of the MFCCs, as estimated from the EMG data versus from the reference audio (Figure 2), suggesting that lower coefficients (i. e., large spectral changes) were predicted relatively well, whereas the prediction quality for the higher coefficients was degraded. As demonstrated by our complementary work on retrieving articulatory muscle activity from EMG [19], the CCC for this inverse SCP problem is around 0.62. I. e., results for Speech-to-EMG CCCs have been comparable to the prediction of the first two MFCCs in figure 2.…”
Section: Emg-to-speech Conversionmentioning
confidence: 85%
“…Hier könnten insbesondere biosignalbasierte adaptive Systeme [26] einen Beitrag leisten, indem sie z. B. hel-fen, die verbleibenden sprachlichen und kommunikativen Fähigkeiten einer er-kranktenPersonzu unterstützen [27][28][29][30]. Ebenso könnten diese Systeme lernen zu erkennen, durch welche Angebote und Anregungen die an Demenz erkrankte Person wieder zu mehr aktiver Teilnahme (englisch: "Engagement") am Alltagsgeschehen angeregt werden könnte [31][32][33][34].…”
Section: Vom Worst Case Zum Best Case Der Ki-nutzung Im Gesundheitsbe...unclassified