2019 3rd International Conference on Computing Methodologies and Communication (ICCMC) 2019
DOI: 10.1109/iccmc.2019.8819747
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of acoustical models of GMM-HMM based for speech recognition in Hindi using PocketSphinx

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…Speech recognition technology begins with the recognition of a single phoneme instead of recognizing a continuous word [27]. The phoneme recognition in the state-of-the-art speech recognition model is done with the help of the Gaussian Mixer Model (GMM)-Hidden Markov Model (HMM)-Language Model (LM) paradigm [22,27]. In the GMM-HMM-LM paradigm, GMM will process input speech feature vector (i.e.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Speech recognition technology begins with the recognition of a single phoneme instead of recognizing a continuous word [27]. The phoneme recognition in the state-of-the-art speech recognition model is done with the help of the Gaussian Mixer Model (GMM)-Hidden Markov Model (HMM)-Language Model (LM) paradigm [22,27]. In the GMM-HMM-LM paradigm, GMM will process input speech feature vector (i.e.…”
Section: Related Workmentioning
confidence: 99%
“…In the GMM-HMM-LM paradigm, GMM will process input speech feature vector (i.e. Mel Frequency Cepstral Coefficient (MFCC) [30]) and emits emission probability for HMM [5,22,27]. The HMM together with LM compute the most likely sequence of phoneme with the help of a decoder [6].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The acoustic model (AM) in an SR system creates the essential units of speech in the composed structure regarding a specific input signal [15]. The signal which acts as input is grafted up into overlapping periods of 10 ms with a 5 ms. At that point from each frame, 39 MFCC [1] co-efficient are extricated.…”
Section: Csr Acoustic Modelmentioning
confidence: 99%