Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-2124
|View full text |Cite
|
Sign up to set email alerts
|

KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…In order to convert the text response from ChatGPT to speech, we used the gTTS (Google Text-to-Speech) library [16] for the English language and the Kazakh TTS model [17] for the Kazakh language. The gTTS is an interface to Google Translate's Text-to-Speech API.…”
Section: Text-to-speech (Tts)mentioning
confidence: 99%
“…In order to convert the text response from ChatGPT to speech, we used the gTTS (Google Text-to-Speech) library [16] for the English language and the Kazakh TTS model [17] for the Kazakh language. The gTTS is an interface to Google Translate's Text-to-Speech API.…”
Section: Text-to-speech (Tts)mentioning
confidence: 99%
“…Next, the generated caption text was converted into audio, using a text-to-speech model. In our study, we utilized the KazakhTTS model [25] to convert Kazakh text to speech. Finally, the generated audio was played through the user's headphones.…”
Section: Model Deploymentmentioning
confidence: 99%
“…To address the aforementioned problem, many datasets have been developed in less popular languages. For example, to advance speech processing research in Kazakhstan, researchers developed open-source Kazakh speech corpora for building speech recognition [17] and speech synthesis [24] applications. To enable speech research and increase accessibility of speech-enabled applications for illiterate users, Doumbouya et al [9] released 150 hours of transcribed audio data for West African languages.…”
Section: Related Workmentioning
confidence: 99%