2012
DOI: 10.1109/tasl.2011.2178597
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Robust Automatic Speech Recognition Using Compact Support Vector Machines

Abstract: Abstract-In the last years, support vector machines (SVMs) have shown excellent performance in many applications, especially in the presence of noise. In particular, SVMs offer several advantages over artificial neural networks (ANNs) that have attracted the attention of the speech processing community. Nevertheless, their high computational requirements prevent them from being used in practice in automatic speech recognition (ASR), where ANNs have proven to be successful. The high complexity of SVMs in this c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 33 publications
(14 citation statements)
references
References 46 publications
0
14
0
Order By: Relevance
“…While training the system the training dataset (T D ) is divided according to different gender dependent the age group like M Y , M A , M S , F Y , F A , F S, and also Training dataset is divided as per emotions type is the speaker speaking in the happy, sadness, disgust, fear, surprise and neutral (S, H, D, S, F, N) emotions. V is the extracted feature's vector, using the feature vector by concatenating all V create the supervector feature presentation using the SVM and GMM model for each type speaker characteristic the template will do that for you [9].…”
Section: System Architecturementioning
confidence: 99%
“…While training the system the training dataset (T D ) is divided according to different gender dependent the age group like M Y , M A , M S , F Y , F A , F S, and also Training dataset is divided as per emotions type is the speaker speaking in the happy, sadness, disgust, fear, surprise and neutral (S, H, D, S, F, N) emotions. V is the extracted feature's vector, using the feature vector by concatenating all V create the supervector feature presentation using the SVM and GMM model for each type speaker characteristic the template will do that for you [9].…”
Section: System Architecturementioning
confidence: 99%
“…Initially DNN based system are used on phone recognition task. After that DNNs are used at large scale on big vocabulary continuous speech [7][8][9][10].When DNN based systems are compared with other systems like dynamic time warping (DTW), Hidden Markov Models (HMM), Gaussian mixture models (GMM) based systems, then recognition accuracy is more for DNN based systems [11].There is fast learning in deep neural network by parameterization of weight matrix by using periodic functions. In this way training time is reduced and classification accuracy improves.…”
Section: Introductionmentioning
confidence: 99%
“…Alternatively, it has been demonstrated that support vector machines (SVMs) provide an effective approach to avoid the aforementioned problems of ANNs, which are a new machine learning method for classification and regression based on statistical learning theory and structural risk minimization principle [16]- [20]. SVMs have been effectively applied in function estimation [21]- [23], fault diagnose [24], [25], data mining [26], [27], and speech recognition [28], [29]. Furthermore, least squares support vector machines (LS-SVMs) possess faster arithmetic speed compared to SVMs [30].…”
Section: Introductionmentioning
confidence: 99%