2022
DOI: 10.3390/e24030414
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech

Abstract: Speaker recognition is an important classification task, which can be solved using several approaches. Although building a speaker recognition model on a closed set of speakers under neutral speaking conditions is a well-researched task and there are solutions that provide excellent performance, the classification accuracy of developed models significantly decreases when applying them to emotional speech or in the presence of interference. Furthermore, deep models may require a large number of parameters, so c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(1 citation statement)
references
References 32 publications
0
1
0
Order By: Relevance
“…Unlike the method proposed in [29], we do not utilize MFCC (Mel frequency cepstral coefficients) as inputs. Instead, we adopt the approach of generating spectrograms using short-time Fourier transform (STFT), following the methodology described in [30].…”
Section: Audiovisual Emotion Recognition Modelmentioning
confidence: 99%
“…Unlike the method proposed in [29], we do not utilize MFCC (Mel frequency cepstral coefficients) as inputs. Instead, we adopt the approach of generating spectrograms using short-time Fourier transform (STFT), following the methodology described in [30].…”
Section: Audiovisual Emotion Recognition Modelmentioning
confidence: 99%