2018
DOI: 10.1007/s00521-018-3760-2
|View full text |Cite
|
Sign up to set email alerts
|

Novel cascaded Gaussian mixture model-deep neural network classifier for speaker identification in emotional talking environments

Abstract: This research is an effort to present an effective approach to enhance text-independent speaker identification performance in emotional talking environments based on novel classifier called cascaded Gaussian Mixture Model-Deep Neural Network (GMM-DNN). Our current work focuses on proposing, implementing and evaluating a new approach for speaker identification in emotional talking environments based on cascaded Gaussian Mixture Model-Deep Neural Network as a classifier. The results point out that the cascaded G… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
30
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 50 publications
(30 citation statements)
references
References 29 publications
0
30
0
Order By: Relevance
“…There is indeed a very small body of work on SI for Arabic especially in an emotional state as shown in Table 1. Table 8 presents a comparison between our CRNN model and Shahin's models [28], [30], [44], [45], and [60] to identify a speaker in a specific emotion. Table 9 presents another comparison between the proposed system and other previous studies in an emotional environment.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…There is indeed a very small body of work on SI for Arabic especially in an emotional state as shown in Table 1. Table 8 presents a comparison between our CRNN model and Shahin's models [28], [30], [44], [45], and [60] to identify a speaker in a specific emotion. Table 9 presents another comparison between the proposed system and other previous studies in an emotional environment.…”
Section: Resultsmentioning
confidence: 99%
“…Table 1 presents a summary of Arabic speaker recognition from 2010 to 2019 in different environments of neutral and emotional speech, as well as shouting. Many remarkable studies were reported by Shahin et al [6], [28]- [30] with the goal of developing SV systems using different classifiers and targeting Emirati-accented speakers. Shahin et al [30] proposed a novel classifier called the cascaded GMM deep neural network (GMM-DNN) to enhance text-independent SI performance using two corpora: the Emirati speech database and ''speech under simulated and actual stress'' English database.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…In a recent study by Shahin et.al [7], they proposed an efficient methodology to improve "text-independent speaker identification accuracy in emotional talking environments based on a novel classifier called cascaded Gaussian Mixture Model-Deep Neural Network (GMM-DNN). Specifically, their work focused on proposing, implementing, and assessing a new framework for speaker identification in emotional talking environments based on cascaded Gaussian Mixture Model-Deep Neural Network as a classifier.…”
Section: Introduction and Literature Reviewmentioning
confidence: 99%
“…In emotional/stressful talking environments, Shahin et al [5] enhanced "text-independent speaker identification accuracy in emotional talking environments based on novel classifier named cascaded Gaussian Mixture Model-Deep Neural Network (GMM-DNN). Their work focused on proposing, applying, and testing a new framework for speaker identification in such talking environments based on sequential Gaussian Mixture Model followed by Deep Neural Network as a classifier.…”
Section: Introduction and Literature Reviewmentioning
confidence: 99%