2013
DOI: 10.1186/1687-4722-2013-8
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech

Abstract: This article analyzes and compares influence of different types of spectral and prosodic features for Czech and Slovak emotional speech classification based on Gaussian mixture models (GMM). Influence of initial setting of parameters (number of mixture components and used number of iterations) for GMM training process was analyzed, too. Subsequently, analysis was performed to find how correctness of emotion classification depends on the number and the order of the parameters in the input feature vector and on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(9 citation statements)
references
References 32 publications
0
9
0
Order By: Relevance
“…Unlike LDA, each word in an RSM is generated by multiple topics, leading to distributed and richer representations. Furthermore, the topics are inferred in a single pass via Equation 6, offering a significant reduction in computational complexity.…”
Section: Replicated Softmax Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…Unlike LDA, each word in an RSM is generated by multiple topics, leading to distributed and richer representations. Furthermore, the topics are inferred in a single pass via Equation 6, offering a significant reduction in computational complexity.…”
Section: Replicated Softmax Modelmentioning
confidence: 99%
“…Segmental approaches operate directly on these frames using static Gaussian mixture models (GMM) [6], dynamic hidden Markov models (HMM) [7,8], or their variants [9]. Based on studies indicating the suprasegmental behavior of emotions [2,10], turn-level features have shown to significantly outperform HMM-based approaches.…”
Section: Introductionmentioning
confidence: 99%
“…For implementation of the supra-segmental parameters of speech, the statistical types-median values, range of values, std, and/or relative maximum and minimum were used in the feature vectors. The length of the input feature vector N f eat = 16 was experimentally chosen in correspondence with the obtained results of our previous research [16]. The three tested feature sets were: P1 consisting of the basic spectral features together with the supra-segmental parameters, P2 consisting of the supplementary spectral features and the supra-segmental parameters, and P3 being a mix of the basic and the supplementary spectral features with the prosodic parameters.…”
Section: Material Experiments and Resultsmentioning
confidence: 99%
“…The extended vector PN15 consists of the features with MUTI SF < N SFC and h = 1: En c0 , En r0 , HNR, S centr , S spread , S skew , S tilt, SFM, SHE, F0 DIFF , L ZCR , F ZCR , F 1 /F 2 , J abs , AP rel . According to the results published in [10], [20], the length of the input data vector for GMM training and testing was set to 16.…”
Section: Materials and Processing Conditionsmentioning
confidence: 99%