Speech synthesizer is a technology which gives the computer a capability to speech text sequences. In this research, we develop a speech synthesizer for Indonesian Language based on the Hidden Markov Model (HMM). The Speech synthesizer using the HMM can produce more appropriate result than based on the syllable concatenation. There are some studies of speech synthesizers using HMM for Indonesian Language. However, it still has some problems such as it still cannot distinguish between vowel "e" ("e" in "get" is different from "e" in "apple"); It cannot handle abbreviation, numbers, special characters, and foreign (English) terms widely. In this research, we also proposed some methods to solve those problems. To solve "e" problem, this research divided the HMM for the 2 "e" vowel. To solve the other problems, the "e" rules, the abbreviation rules, the number rules, the special character rules, and the foreign term rules are made. To evaluate the synthesizer, we employ two methods: the Mean Opinion Score (MOS) to measure the naturalness of synthesized speech; and the Semantically Unpredictable Sentence (SUS) to measure the accuracy of the synthesized speech. Result shows that the developed speech synthesizer improved the naturalness of synthesized speech. It achieves 4.1 for MOS point and 96,07 % word accuracy.
Keywords-speech synthesizer; hidden markov model; MOS; SUS; bahasa IndonesiaI.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.