6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018) 2018
DOI: 10.21437/sltu.2018-10
|View full text |Cite
|
Sign up to set email alerts
|

Designing an IVR Based Framework for Telephony Speech Data Collection and Transcription in Under-Resourced Languages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…This corpus contains speech data of some rarely studied Indian languages, such as Angami, Bodo, Khasi, Hrangkhawl, and Sumi. Some of the other sources for availing standard Indian speech data are Speehocean, 11 Indic-TTS, 12 13 There are also developments in open-source corpora, such as Mozilla Common Voice, 14 OpenSLR, 15 with speech data for the Indian languages. In Table 3, we have summarized the key information about the major speech corpora developed for Indian spoken language recognition research.…”
Section: Other Developmentsmentioning
confidence: 99%
See 1 more Smart Citation
“…This corpus contains speech data of some rarely studied Indian languages, such as Angami, Bodo, Khasi, Hrangkhawl, and Sumi. Some of the other sources for availing standard Indian speech data are Speehocean, 11 Indic-TTS, 12 13 There are also developments in open-source corpora, such as Mozilla Common Voice, 14 OpenSLR, 15 with speech data for the Indian languages. In Table 3, we have summarized the key information about the major speech corpora developed for Indian spoken language recognition research.…”
Section: Other Developmentsmentioning
confidence: 99%
“…Open source corpora can be developed for the Indian languages by crowd-sourcing or collecting data from the web. For each language, data should be collected from speakers from diferent regions, genders, age groups, and sections of the society [14]. Variations in terms of background noise, recording channels, and room environments should be maintained [70].…”
Section: Issue Of Low-resourcementioning
confidence: 99%