2019
DOI: 10.1049/iet-spr.2018.5131
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of isolated digits using DNN–HMM and harmonic noise model

Abstract: Speech recognition is an area that is constantly developing. In this study, the authors present a new system of speech recognition applied to the Arabic language. The system proposed here is based on the harmonic plus noise model (HNM). This model is rather used in speech synthesis tasks and is known for providing excellent speech production quality. Thus, their contribution lies in replacing the conventional mel-frequency cepstrum coefficients (MFCC) parameters with a set of acoustic parameters, extracted thr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…Clustering methods have acquired many successful applications in the past decades [1][2][3][4], such as K-Means [5] and spectral clustering [6]. Recently, deep clustering methods attract more attention [7][8][9][10][11][12], which transfers the original samples into a deep network to learn an abstract and discriminative feature representation. Different from the above traditional methods, deep clustering methods learn the in-depth information.…”
Section: Introductionmentioning
confidence: 99%
“…Clustering methods have acquired many successful applications in the past decades [1][2][3][4], such as K-Means [5] and spectral clustering [6]. Recently, deep clustering methods attract more attention [7][8][9][10][11][12], which transfers the original samples into a deep network to learn an abstract and discriminative feature representation. Different from the above traditional methods, deep clustering methods learn the in-depth information.…”
Section: Introductionmentioning
confidence: 99%
“…However, the MFCC method is limited to the frequency conversion relationship [16]. In order to make full use of MFCC method to extract vibration signal features, we need to improve its Mel frequency scale [17]. Pliwak proposed an adaptive frequency scale (SFCC) based on human voice to improve the MFCC feature extraction method [18], and achieved good results compared with the traditional MFCC feature extraction results.…”
Section: Introductionmentioning
confidence: 99%