Interspeech 2013 2013
DOI: 10.21437/interspeech.2013-20
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of emotional speech at subsegmental level

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
3
1

Relationship

4
5

Authors

Journals

citations
Cited by 21 publications
(5 citation statements)
references
References 12 publications
0
5
0
Order By: Relevance
“…These features were shown to be useful for discriminating phonation types in speech and singing [13], [41], [42]. SoE was shown to be proportional to the rate of glottal closure, the EoE feature was shown to capture the vocal effort, and the loudness measure was shown to capture the abruptness of the glottal closure [43], [44]. The energy of the ZFF signal at glottal closure is also used as a feature, it was shown to capture low frequency energy [13].…”
Section: ) Zero Frequency Filtering (Zff)mentioning
confidence: 99%
“…These features were shown to be useful for discriminating phonation types in speech and singing [13], [41], [42]. SoE was shown to be proportional to the rate of glottal closure, the EoE feature was shown to capture the vocal effort, and the loudness measure was shown to capture the abruptness of the glottal closure [43], [44]. The energy of the ZFF signal at glottal closure is also used as a feature, it was shown to capture low frequency energy [13].…”
Section: ) Zero Frequency Filtering (Zff)mentioning
confidence: 99%
“…This feature was used in the analysis and classification of vocal emotions in [58,59], where the feature was shown to be large for emotions of low arousal (such as sadness), and small for emotions of high arousal (such as anger and happiness). Therefore, this feature reflects changes in the relative duration of the glottal closed phase in a similar manner to CQ and NAQ [16,60].…”
Section: Features Derived Using Zffmentioning
confidence: 99%
“…where 2K+1 corresponds to the number of samples in the 1-ms window. This feature was shown to reflect the changes in vocal effort [58,59]. The experiments in [58,59] indicated that EoE was generally large for emotions of high arousal and small for emotions of low arousal.…”
Section: Features Derived Using Zffmentioning
confidence: 99%
“…The fundamental frequency (F0 = 1/T0) is a popular acoustic feature that can be estimated from speech using pitch analysis methods, but other source parameters that are correlated with emotions can also be measured from speech, e.g. the strength of excitation and energy of excitation [2,3], aperiodicity features (jitter, shimmer, etc.) [4,5] and energy ratio between different frequency bands [6].…”
Section: Introductionmentioning
confidence: 99%