6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018) 2018
DOI: 10.21437/sltu.2018-40
|View full text |Cite
|
Sign up to set email alerts
|

Assessing Performance of Bengali Speech Recognizers Under Real World Conditions using GMM-HMM and DNN based Methods

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…Impact of CS and other kinds of language switches on the ASR systems has recently received research interest, resulting in multiple approaches for CS acoustic and language modeling [7][8][9][10][11][12][13][14][15][16]. The main focus of the FAME!…”
Section: Introductionmentioning
confidence: 99%
“…Impact of CS and other kinds of language switches on the ASR systems has recently received research interest, resulting in multiple approaches for CS acoustic and language modeling [7][8][9][10][11][12][13][14][15][16]. The main focus of the FAME!…”
Section: Introductionmentioning
confidence: 99%
“…(ii) The next step is to learn an acoustic model, usually referred to as an encoder to associate these features to higher level speech or orthographic units (phonemes or graphemes). This is achieved either using Gaussian Mixture Models (GMM) (Khan et al, 2018) or a Deep Neural Network (DNN) (Hannun et al, 2014;Amodei et al, 2016;Li et al, 2019) with HMMs, where the GMM/DNN acts as a function approximator and the HMM adds context capabilities to the encoder. With advances in deep learning, Long Short Term Memory networks (LSTMs) have replaced both the GMM/DNN and HMM models, since LSTMs act as function approximators and also learn context-dependence.…”
Section: Asr Training and Resourcesmentioning
confidence: 99%