Phonetic alignment is the task of finding the limits of phones and higher units in an audio file. This has been reliably done in many languages such as English, French and German, but, so far, no available Brazilian Portuguese aligner had a performance comparable with the ones used for these other languages. Thus, the main goal of this work was to implement a useful tool for forced alignment for Brazilian Portuguese. The implementation was done in two steps, the grapheme-to-phoneme conversion and the alignment itself. The Converter is responsible for receiving the input transcription in graphemes and converting it to its equivalent in phonemes and allophones, and was implemented using computational rules derived from the analysis of regular grapheme-phoneme relations in Brazilian Portuguese and an exception dictionary, for words to which no regular rules could be applied. The Aligner was responsible for aligning the phonemes/allophones of the previous module to the corresponding acoustic intervals of the audio file, called "phones". This module was implemented using hidden Markov models. Results for the Converter have an accuracy of over 99%, where the main mistakes involved mid vowels /e/ and /ɛ/ and /o/ and /ɔ/. As for the Aligner, the best model has 87% of the alignments with errors below 25 ms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.