TENCON 2015 - 2015 IEEE Region 10 Conference 2015
DOI: 10.1109/tencon.2015.7372840
|View full text |Cite
|
Sign up to set email alerts
|

Progress in speech emotion recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
10
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 41 publications
0
10
0
Order By: Relevance
“…• Time-Based Features They are zero-crossing rate (ZCR) [8] and amplitudebased features, such as amplitude descriptor, log attack time, attach, delay, sustain, release envelop, short-time energy (STE) [9], shimmer [10], rhythm-based features [8], [11], volume, and temporal centroid [3].…”
Section: Acoustic Features In Ser Literaturementioning
confidence: 99%
See 1 more Smart Citation
“…• Time-Based Features They are zero-crossing rate (ZCR) [8] and amplitudebased features, such as amplitude descriptor, log attack time, attach, delay, sustain, release envelop, short-time energy (STE) [9], shimmer [10], rhythm-based features [8], [11], volume, and temporal centroid [3].…”
Section: Acoustic Features In Ser Literaturementioning
confidence: 99%
“…Spectral features that well simulate the vocal tract [7], [15]. The quality of the voice depends on the physical structure of the vocal tract and the glottal waveforms [8]. It includes HNR, jitter, shimmer, normalized amplitude quotient, quasi open quotient, parabolic spectral parameter, and maxima dispersion quotient [7], [15].…”
Section: Acoustic Features In Ser Literaturementioning
confidence: 99%
“…Most of the modern investigations report speech emotion classification rates of 70-90% [4,45,54]. Particular results depend on the analyzed language, the number of emotions, the speaker mode, and other important factors.…”
Section: Introductionmentioning
confidence: 99%
“…Speaker emotion recognition is based on the emotional feature analysis. The most important speech emotion features include speech quality feature, spectral feature and prosodic feature [1,2]. We cannot achieve good results using only one of these types of features.…”
Section: Introductionmentioning
confidence: 99%
“…emotion feature number). Then each utterance feature ijn P ( n is the emotion sample index) is normalized Calculate the feature dispersion for certain emotion1 The feature contribution and the Euclidean distance are summed ( ) each feature is taken as the sum weight and it is simple enough for KNN classification. The inner relation between features is modeled and the performance of emotion classification is improved.…”
mentioning
confidence: 99%