Background: The automatic syllabification process is an essential prerequisite for speech synthesis systems. However, the task is not trivial, and several techniques have been adopted over the last decade. Furthermore, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This paper discusses ways to diminish this drawback, through the implementation of an open-source syllabification system for BP. Methods: The proposed tool is based on published rule-based algorithms, with some new proposals, especially in the treatment of words with diphthongs and hiatus. Results: Computer experiments were performed on a randomly chosen extract of the CETEN-Folha text corpus, and the results showed the percentage of correctly syllabified words of 99%. Conclusions: A subjective evaluation was also conducted in order to compare the elaborated syllabification algorithm with the reference one within a text-to-speech system for BP. All developed codes and databases are publicly available.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.