Speech Modelling Based on Acoustic-to-Articulatory Mapping

Schoentgen, Jean

doi:10.1007/11520153_6

Cited by 1 publication

(3 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It consists in identifying, for a chosen vocal tract model constituted by a set of overlapping tubes, a set of parameters so that the resonances of the model correspond to the formants observed in a given produced speech sound. As is has been observed in [69], distinct vocal tract shapes can produce the same set of formant frequencies and therefore, a given set of formant values cannot univocally identify the vocal tract shape that has generated them (inverse problem). There are infinite solutions for the inverse problem.…”

Section: Nonlinearities In Speech Productionmentioning

confidence: 96%

“…This is done taking three aspects into account: a) the accuracy of the vocal tract shapes estimate via formant-toarea mapping in comparison to the real vocal shapes; b) the underlying vocal tract models used; c) the numerical stability. Results shown that a good approximation is guaranteed only for speech sounds that are produced with a simple vocal tract configuration "single cavity, single constriction, convex tongue, as well as constrained in the laryngo-pharynx, and, possibly, at the lips" [69]; models based on a small number of conical tubelets with continuously varying cross area sections are preferred to exponential tubelets and cylindrical tubelets; the convergence to the desired format frequency values could be obtained with a precision greater than 1 Hz, even though the estimated vocal tract shapes could quantitatively and qualitatively differ from those built via the observed formant frequencies.…”

Section: Nonlinearities In Speech Productionmentioning

confidence: 98%

“…Even when functional constraints are imposed (such as minimal deformation, or minimal deformation rate of the vocal tract about a reference shape, or minimal deformation jerk) the inverse mapping does not fix the model, i.e., the real vocal tract shape that has produced that sound. To highlight the problems involved in the implementation of the acoustic-to-articulatory mapping, Schoentgen [69] reports a series of experiments where formant frequencies measured from sustained American English sounds are used to identify the corresponding vocal tract shapes. This is done taking three aspects into account: a) the accuracy of the vocal tract shapes estimate via formant-toarea mapping in comparison to the real vocal shapes; b) the underlying vocal tract models used; c) the numerical stability.…”

Section: Nonlinearities In Speech Productionmentioning

confidence: 99%

See 2 more Smart Citations

Some Notes on Nonlinearities of Speech

Esposito

Marinaro

2005

Nonlinear Speech Modeling and Applications

View full text Add to dashboard Cite

Abstract. Speech is exceedingly nonlinear. Efforts to propose non-linear models of its dynamics are worth to be made but difficult to implement since nonlinearity is not easily handled from an engineering and mathematical point of view. This paper is an attempt to make accessible to untrained people the notion of nonlinearity in speech, revising several nonlinear speech phenomena and the engineering endeavour for modeling them.

show abstract

Section: Nonlinearities In Speech Productionmentioning

confidence: 96%

Section: Nonlinearities In Speech Productionmentioning

confidence: 98%

Section: Nonlinearities In Speech Productionmentioning

confidence: 99%

See 1 more Smart Citation