2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8462575
|View full text |Cite
|
Sign up to set email alerts
|

A Complete End-to-End Speaker Verification System Using Deep Neural Networks: From Raw Signals to Verification Result

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
47
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 66 publications
(47 citation statements)
references
References 10 publications
0
47
0
Order By: Relevance
“…The b-vector system was first introduced for scoring the i-vectors. Then, it has been shown that the b-vector based system could be efficient classifier of embeddings from DNNs in speaker verification task [6,30]. In our experiments, the b-vector based system comprises three fully-connected lay- ers, each with 512 nodes, and an output layer with two nodes.…”
Section: Additional Validation Using Voxceleb2mentioning
confidence: 99%
“…The b-vector system was first introduced for scoring the i-vectors. Then, it has been shown that the b-vector based system could be efficient classifier of embeddings from DNNs in speaker verification task [6,30]. In our experiments, the b-vector based system comprises three fully-connected lay- ers, each with 512 nodes, and an output layer with two nodes.…”
Section: Additional Validation Using Voxceleb2mentioning
confidence: 99%
“…We propose a model (referred to as "RawNet" for convenience) that is an improvement of the CNN-LSTM model in [5,7] by changing architectural details (Section 2.1. ), proposing a modified pre-training scheme (Section 2.2.…”
Section: Front-end: Rawnetmentioning
confidence: 99%
“…It includes a number of modifications to the CNN-LSTM model in [5,7], allowing for further improvement. First, activation functions are changed from rectified linear units (ReLU) to leaky ReLU.…”
Section: Model Architecturementioning
confidence: 99%
See 1 more Smart Citation
“…The DNN used in this study comprises convolutional neural networks (CNNs), gated recurrent units (GRUs) and fully connected layers (CNN-GRU) as used in [15][16][17]. In this architecture, input features are first processed using convolutional layers to extract frame-level embeddings.…”
Section: End-to-end Dnnmentioning
confidence: 99%