Interspeech 2009 2009
DOI: 10.21437/interspeech.2009-727
|View full text |Cite
|
Sign up to set email alerts
|

ASR corpus design for resource-scarce languages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
5
2
1

Relationship

4
4

Authors

Journals

citations
Cited by 35 publications
(14 citation statements)
references
References 6 publications
0
14
0
Order By: Relevance
“…The value of η d is empirically selected based on the performance of the system indicated by the Equal Error Rate (EER). The EER value indicates the operating point where the system's false acceptance rate is equal to its false rejection rate (Beigi, 2011). η d of 30dB is found to present the best performance with the lowest EER.…”
Section: Data Augmentation For the Establishment Of The I-vector Systemmentioning
confidence: 97%
See 2 more Smart Citations
“…The value of η d is empirically selected based on the performance of the system indicated by the Equal Error Rate (EER). The EER value indicates the operating point where the system's false acceptance rate is equal to its false rejection rate (Beigi, 2011). η d of 30dB is found to present the best performance with the lowest EER.…”
Section: Data Augmentation For the Establishment Of The I-vector Systemmentioning
confidence: 97%
“…This in turn is realised using autocorrelation (Broersen, 2006). LPCC are calculated using a recursive process (Beigi, 2011).…”
Section: Multitaper-fitted Lpccmentioning
confidence: 99%
See 1 more Smart Citation
“…The word "phone" was coined in [5] as an abbreviation for "phonetic symbol," defined in [6] as an element of a phonetic transcription corresponding to at most one phoneme, whose boundary times in the acoustic signal can be reliably identified using automatic forced alignment. Such language-dependent ASR segment inventories may be expressed using the language-independent symbols of the IPA [7], and their set union defines a language-independent phone inventory, which may be trained using multilingual data [8]; alternatively, language-dependent phone models may be trained using far less data than language-dependent word models, because the number of phones in a language is far fewer than the number of words [9]. In order to use phone-based acoustic models, however, it is necessary to discover the phone inventory of the unseen language.…”
Section: Introductionmentioning
confidence: 99%
“…The limited availability of speech corpora is a major constraint on the development of automatic speech recognition (ASR) in under-resourced languages and dialects [1,2]. Consequently, there is significant interest in ways to develop such corpora efficiently [3], and the efficient exploitation of limited corpora [1,4].…”
Section: Introductionmentioning
confidence: 99%