Pétur Helgason scite author profile

Vocal sound imitations provide a new challenge for understanding the coupling between articulatory mechanisms and the resulting audio. In this study, the classification of three articulatory categories, phonation, supraglottal myoelastic vibrations, and turbulence, have been modeled from audio recordings. Two data sets were assembled, consisting of different vocal imitations by four professional imitators and four non-professional speakers in two different experiments. The audio data were manually annotated by two experienced phoneticians using a detailed articulatory description scheme. A separate set of audio features was developed specifically for each category using both time-domain and spectral methods. For all time-frequency transformations, and for some secondary processing, the recently developed Auditory Receptive Fields Toolbox was used. Three different machine learning methods were applied for predicting the final articulatory categories. The result with the best generalization was found using an ensemble of multilayer perceptrons. The cross-validated classification accuracy was 96.8% for phonation, 90.8% for supraglottal myoelastic vibrations, and 89.0% for turbulence using all the 84 developed features. A final feature reduction to 22 features yielded similar results. V

show abstract

Swedish quantity: Central Standard Swedish and Fenno-Swedish

Helgason¹,

Ringen²,

Suomi³

2013

Journal of Phonetics

View full text Add to dashboard Cite

The interaction of phonetics, phonology and morphology in an icelandic text-to-speech system

Granström¹,

Helgason²,

Þráinsson³

1992

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pétur Helgason

Rate effects on Swedish VOT: Evidence for phonological overspecification

Voicing and aspiration in Swedish stops

Prediction of three articulatory categories in vocal sound imitations using models for auditory receptive fields

Swedish quantity: Central Standard Swedish and Fenno-Swedish

The interaction of phonetics, phonology and morphology in an icelandic text-to-speech system

Contact Info

Product

Resources

About