2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7472649
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of DNN approaches to speaker identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
71
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 67 publications
(72 citation statements)
references
References 11 publications
1
71
0
Order By: Relevance
“…So, Deep Neural Network (DNN) has also a perfect and successful entry into the field of audio signal processing. DNN was introduced into the field of speaker identification as a successor of Automatic Speech Recognition (ASR) which was a comprehensive success [19].…”
Section: Deep Neural Networkmentioning
confidence: 99%
See 2 more Smart Citations
“…So, Deep Neural Network (DNN) has also a perfect and successful entry into the field of audio signal processing. DNN was introduced into the field of speaker identification as a successor of Automatic Speech Recognition (ASR) which was a comprehensive success [19].…”
Section: Deep Neural Networkmentioning
confidence: 99%
“…Ali et.al[22] studied the use of features from distinct levels of Deep Belief Network (DBN) to quantize the audio data into vectors of "audio-word counts".Table 6demonstrates speaker identification performance based on each of GMM-DNN and the three different prior work using the ESD. This table yields 5.8%, 3.9%, and 5.6% improvement rate in speaker identification performance based on the proposed classifier over that based on DNN-BN[19], single DNN[21], and DBN[22], respectively. Hence, GMM-DNN offers a robust and computationally efficient novel classification technique for "speaker identification in emotional environments".…”
mentioning
confidence: 93%
See 1 more Smart Citation
“…For speaker-related tasks, uncertainty in features can be represented by several speaker models, among which Vector Quantization (VQ), Gaussian mixture models (GMMs) [1] and i-vector [2] are the most successful examples proposed in the past decades. Recently, deep Neural Networks (DNNs), especially Convolution Neural Networks (CNNs) also have been widely and successfully applied to extract deep features to represent speakers [3], [4], [5].…”
Section: Introductionmentioning
confidence: 99%
“…Various neural network-based approaches were proposed in [18], without considering different noise and handset conditions. Furthermore, other researchers have employed deep neural network (DNN) analysis for speaker identification [19]. In [20], the authors selected 100 speakers from the TIMIT and self-collected databases using novel fuzzy vector quantization (NFVQ) techniques to enhance the speaker identification system (SIS).…”
Section: Introductionmentioning
confidence: 99%