DOI: 10.1007/978-3-540-69731-2_84
|View full text |Cite
|
Sign up to set email alerts
|

Emotion Recognition with Poincare Mapping of Voiced-Speech Segments of Utterances

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 13 publications
0
7
0
Order By: Relevance
“…The Polish Polish Emotional Speech Database (PESD) [ 2 ] was prepared and shared by the Medical Electronics Division, Lodz University of Technology. The database consists of 240 samples recorded in the aula of the Polish National Film Television and Theater School in Lodz.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The Polish Polish Emotional Speech Database (PESD) [ 2 ] was prepared and shared by the Medical Electronics Division, Lodz University of Technology. The database consists of 240 samples recorded in the aula of the Polish National Film Television and Theater School in Lodz.…”
Section: Methodsmentioning
confidence: 99%
“…Proper identification of emotional state can significantly improve quality of human-computer interfaces. It can be applied for monitoring of psycho-physiological states of individuals e.g., to assess the level of stress or fatigue, forensic data analysis [ 2 ], advertisement [ 3 ], social robotic [ 4 ], video conferencing [ 5 ], violence detection [ 6 ], animation or synthesis of life-like agents xue2018voice, and many others. Automatic emotion recognition methods utilize various input types i.e., facial expressions [ 7 , 8 , 9 ], speech [ 10 , 11 , 12 ], gesture and body language [ 13 , 14 ], physical signals such as electrocardiogram (ECG), electromyography (EMG), electrodermal activity, skin temperature, galvanic resistance, blood volume pulse (BVP), and respiration [ 15 ].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Although this approach requires a short training phase, nevertheless the test phase involves high computational cost. DT is required to high memory and higher error rates occurs if number of samples is low [19].…”
Section: B Classification Algorithmsmentioning
confidence: 99%
“…Since many speech recognition systems are trained solely on neutrally pronounced speech, they can not efficiently cope with phonetic changes caused by altered psychical or physical state of the speaker. ASR performance relation to various stress conditions and emotional states of a user has received an increasing amount of interest in the last decade [6]- [9]. Other sources of variability may arise from speaker's fatigue level [10], [11], various illnesses, alcohol or drug intoxication, etc.…”
Section: Introductionmentioning
confidence: 99%