2018 International Conference on Communication and Signal Processing (ICCSP) 2018
DOI: 10.1109/iccsp.2018.8524451
|View full text |Cite
|
Sign up to set email alerts
|

A Study of Speech, Speaker and Emotion Recognition Using Mel Frequency Cepstrum Coefficients and Support Vector Machines

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(9 citation statements)
references
References 9 publications
0
9
0
Order By: Relevance
“…The SVM classifier that has variable kernel functions, is one of the widely used optimization-based classifiers. The kernel functions increase the sample space to a higher dimension so that data can be separated linearly and also the kernels reduce the computational complexity by increasing the dimension [6], [33], [49]. The nonlinear classifier, SVM calculates the boundary of the decision precisely for speaker-independent and small-sized sample applications [6].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The SVM classifier that has variable kernel functions, is one of the widely used optimization-based classifiers. The kernel functions increase the sample space to a higher dimension so that data can be separated linearly and also the kernels reduce the computational complexity by increasing the dimension [6], [33], [49]. The nonlinear classifier, SVM calculates the boundary of the decision precisely for speaker-independent and small-sized sample applications [6].…”
Section: Methodsmentioning
confidence: 99%
“…ML involves two processes, training and testing. The conventional classifiers are naive Bayes [6], decision trees (DT) [6], artificial neural networks (ANN) [8], hidden Markov model (HMM) [16], Gaussian mixture model (GMM) [24], extreme Learning machine (ELM) [25], knearest neighbor (KNN) [25], incomplete sparse least square regression [32], and support vector machines (SVM) [33]. Kmeans algorithm is an unsupervised clustering method; other classifiers may be used with in ensemble methods [34].…”
Section: Classification In the Ser Literaturementioning
confidence: 99%
See 1 more Smart Citation
“…MFCC is one of the most popular speech feature to be utilized for SER, as shown in research [1][2][3][4][5][6][7]. This common usage enables us to benchmark the results.…”
Section: Mel-frequency Cepstral Coefficients (Mfccs) Feature Extractionmentioning
confidence: 99%
“…The recent trend of performing SER is to employ machine learning for the system to learn the speech emotion feature patterns. There are many popular machine learning models such as support vector machine (SVM) [1][2][3], Hidden Markov Model (HMM) [4,5], or artificial neural networks in many forms such as convolutional neural networks (CNN) [6,7] or recurrent neural network (RNN) [8]. A neural network can be considered deep when it uses more than a single layer.…”
Section: Introductionmentioning
confidence: 99%