Interspeech 2014 2014
DOI: 10.21437/interspeech.2014-298
|View full text |Cite
|
Sign up to set email alerts
|

Opti-speech: a real-time, 3d visual feedback system for speech training

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 17 publications
0
7
0
Order By: Relevance
“…Articulatory movement prediction from text input can be useful for audiovisual speech synthesis. A specific application is computer-assisted pronunciation training / computer-aided language learning [26,27,28], which can be beneficial for learners of second languages. With such a combined TTS and text-to-articulatory prediction system, by giving an arbitrary input text, one is able to hear the speech and, in synchrony with it, see how to move the tongue in 2D or 3D to produce target speech sounds.…”
Section: Discussionmentioning
confidence: 99%
“…Articulatory movement prediction from text input can be useful for audiovisual speech synthesis. A specific application is computer-assisted pronunciation training / computer-aided language learning [26,27,28], which can be beneficial for learners of second languages. With such a combined TTS and text-to-articulatory prediction system, by giving an arbitrary input text, one is able to hear the speech and, in synchrony with it, see how to move the tongue in 2D or 3D to produce target speech sounds.…”
Section: Discussionmentioning
confidence: 99%
“…As pointed out in Section 1, the results in AAI might be useful for speech recognition [2], synthesis [3], talking heads [4], and for pronunciation training and language tutoring [5].…”
Section: Discussionmentioning
confidence: 99%
“…Recently, there has been a significant interest in AAI, because learning the correlation between articulation and acoustics could improve the performance of several tasks such as speech recognition [2], synthesis [3] and talking heads [4]. It can help the visualization of speech production as 3D articulatory animations for pronunciation training and language tutoring [5].…”
Section: Introductionmentioning
confidence: 99%
“…Such geometrical models have been successfully used in previous work to generate animations from provided articulatory data: Katz et al [25] presented a real-time visual feedback system that deforms a generic tongue model using EMA data. However, due to the generic nature of the model, their approach did not take anatomical differences into account.…”
Section: A Backgroundmentioning
confidence: 99%