“…This is done taking three aspects into account: a) the accuracy of the vocal tract shapes estimate via formant-toarea mapping in comparison to the real vocal shapes; b) the underlying vocal tract models used; c) the numerical stability. Results shown that a good approximation is guaranteed only for speech sounds that are produced with a simple vocal tract configuration "single cavity, single constriction, convex tongue, as well as constrained in the laryngo-pharynx, and, possibly, at the lips" [69]; models based on a small number of conical tubelets with continuously varying cross area sections are preferred to exponential tubelets and cylindrical tubelets; the convergence to the desired format frequency values could be obtained with a precision greater than 1 Hz, even though the estimated vocal tract shapes could quantitatively and qualitatively differ from those built via the observed formant frequencies.…”