ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053231
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of Voicing and the F0 Contour from Electromagnetic Articulography Data for Articulation-to-Speech Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…While deep EMA-to-speech models have been previously studied, as far as we are aware [35,36,37], current models are not highly intelligible, achieving a transcription WER of around 30% on open-vocabulary tasks [35]. In this work, we build an EMA-to-speech model that achieves a transcription WER of 18.5% and perform detailed error analyses on the synthesized utterances.…”
Section: Articulatory Synthesismentioning
confidence: 99%
“…While deep EMA-to-speech models have been previously studied, as far as we are aware [35,36,37], current models are not highly intelligible, achieving a transcription WER of around 30% on open-vocabulary tasks [35]. In this work, we build an EMA-to-speech model that achieves a transcription WER of 18.5% and perform detailed error analyses on the synthesized utterances.…”
Section: Articulatory Synthesismentioning
confidence: 99%
“…The results showed that an affine transformation can satisfactorily approximate the relation between the two speaking modes. More recently, in [245], pitch prediction (i.e., prediction of the speech voicing and fundamental frequency) from EMA data captured by six coils placed on the upper lip, the lower lip, the lower incisor, the tongue tip, the tongue body, and the tongue dorsum was investigated, achieving surprisingly good results despite EMA not capturing any information about the vibrations of the vocal folds.…”
Section: ) Magnetic Articulographymentioning
confidence: 99%