IberSPEECH 2018 2018
DOI: 10.21437/iberspeech.2018-4
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Recognition under Stress Conditions

Abstract: Speaker Recognition systems exhibit a decrease in performance when the input speech is not in optimal circumstances, for example when the user is under emotional or stress conditions. The objective of this paper is measuring the effects of stress on speech to ultimately try to mitigate its consequences on a speaker recognition task. On this paper, we develop a stress-robust speaker identification system using data selection and augmentation by means of the manipulation of the original speech utterances. An ext… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…The audio recorded is then stored in an only-read Cloud with several encrypting processes and accessibility restrictions. Regarding speech, there are two main tasks to be performed within Bindi, Stress Detection [7] and Speaker Identification [1]. The former focuses in the detection of stress in the victim's voice whereas the latter relies on the proper identification of the victim despite the emotional conditions that may be present in her voice.…”
Section: Uc3m4safety and Bindimentioning
confidence: 99%
See 1 more Smart Citation
“…The audio recorded is then stored in an only-read Cloud with several encrypting processes and accessibility restrictions. Regarding speech, there are two main tasks to be performed within Bindi, Stress Detection [7] and Speaker Identification [1]. The former focuses in the detection of stress in the victim's voice whereas the latter relies on the proper identification of the victim despite the emotional conditions that may be present in her voice.…”
Section: Uc3m4safety and Bindimentioning
confidence: 99%
“…We aim at finding techniques to improve speaker identification systems when facing stressed speech, either by neutralizing the effects of stress or by training the system to cope with it. We propose data augmentation techniques both statistical and using synthetically generated speech under stressed conditions together with an analysis of the best feature extraction methods to design a stress-robust system [1].…”
Section: Introductionmentioning
confidence: 99%
“…In our previous works ( [19], [20]) we explored data augmentation techniques where we created synthetic stressed speech by modifying its pitch and speed. This increased the robustness of the SR system to the distortions caused by the stressed speech signals, achieving a 20% of relative improvement in accuracy.…”
Section: Introductionmentioning
confidence: 99%