2019
DOI: 10.1109/access.2019.2931391
|View full text |Cite
|
Sign up to set email alerts
|

Investigation of Various Hybrid Acoustic Modeling Units via a Multitask Learning and Deep Neural Network Technique for LVCSR of the Low-Resource Language, Amharic

Abstract: Multitask learning (MTL) is helpful for improving the performance of related tasks when the training dataset is limited and sparse, especially for low-resource languages. Amharic is a lowresource language and suffers from the problems of training data scarcity, sparsity, and unevenness. Consequently, fundamental acoustic units-based speech recognizers perform worse compared with the speech recognizers of technologically favored languages. This paper presents the results of our contributions to the use of vario… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
6
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 37 publications
1
6
0
Order By: Relevance
“…Chen et al [91] took grapheme modeling as an additional learning task and used multi-task learning DNN to learn the phone model of the target language. Fantaye et al [92] studied to conduct joint training through multi-task learning for basic acoustic units of a lowresource language Amharic, including syllable, phone and rounded phone.…”
Section: ) Multitask Learningmentioning
confidence: 99%
“…Chen et al [91] took grapheme modeling as an additional learning task and used multi-task learning DNN to learn the phone model of the target language. Fantaye et al [92] studied to conduct joint training through multi-task learning for basic acoustic units of a lowresource language Amharic, including syllable, phone and rounded phone.…”
Section: ) Multitask Learningmentioning
confidence: 99%
“…The DNN model is an old feed-forward network that has been used by several researchers in low-resource speech recognitions. For example, Fantaye et al [7], Sercu et al [34], Sriranjani et al [8], and Chan W. and Lane I. [16] used DNN model for Amharic, Cebuano, Hindi, and Bengali languages, respectively.…”
Section: State-of-the-art Acoustic Modelsmentioning
confidence: 99%
“…Moreover, we used two Ethiopic languages, namely Amharic and Chaha, as additional target languages for evaluating our proposed models. Amharic (https://github.com/besacier/ALFFA_PUBLIC/tree/master/ASR/AMHARIC) is a low-resource language, which has only 26-h read speech training corpus [7]. Chaha (https://m.scirp.org/papers/97733) is a very low-resource language that has only 2.67-h read speech training corpus [42].…”
Section: Data Corpusmentioning
confidence: 99%
See 2 more Smart Citations