Interspeech 2015 2015
DOI: 10.21437/interspeech.2015-297
|View full text |Cite
|
Sign up to set email alerts
|

Locally-connected and convolutional neural networks for small footprint speaker recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
38
0
2

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 51 publications
(40 citation statements)
references
References 9 publications
0
38
0
2
Order By: Relevance
“…We can classify d-vector based SV systems according to the loss function used. The first one is based on the softmax loss defined in [23] as the combination of a cross-entropy loss, a softmax function and the last fully connected layer [7,8,24]. In this system, a speaker classifier is trained to classify speakers in the training set.…”
Section: D-vector Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…We can classify d-vector based SV systems according to the loss function used. The first one is based on the softmax loss defined in [23] as the combination of a cross-entropy loss, a softmax function and the last fully connected layer [7,8,24]. In this system, a speaker classifier is trained to classify speakers in the training set.…”
Section: D-vector Systemsmentioning
confidence: 99%
“…Another deep learning-based approach is to extract speaker embeddings directly from a speaker discriminative network [7][8][9][10][11]. In such systems, the network is trained to classify speakers in the training set, or to separate same-speaker and different-speaker utterance pairs.…”
Section: Introductionmentioning
confidence: 99%
“…In our work we focused on text independent speaker verification [1,2]. Deep learning based speaker verification systems [3,4,5,6,7,8] are getting popular this days and such systems have improved the performance of speaker verification systems.…”
Section: Introductionmentioning
confidence: 99%
“…In the speaker verification field, deep neural networks (DNNs) have been used as speaker embedding extractors. Generally, a speaker embedding-based speaker verification system executes the following process [1][2][3][4]:…”
Section: Introductionmentioning
confidence: 99%