2020 23rd International Conference on Computer and Information Technology (ICCIT) 2020
DOI: 10.1109/iccit51783.2020.9392734
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid GRU-CNN Feature Extraction Technique for Speaker Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 31 publications
0
2
0
Order By: Relevance
“…However, these nonlinear features cannot be used as a stand-alone feature to achieve a performance that is anywhere close to what state-of-the-art techniques (using the linear feature) achieve. These recent techniques are based on deep learning architectures such as Convolutional Neural Networks (CNN) 24 , Wav2Vec2.0 25 , Deeper Feature CNN-Connectionist Temporal Classification (DFCNN-CTC) 26 , Gated Recurrence Unit-CNN (GRU-CNN) 27 etc.…”
Section: Nonlinear Dynamics Of Vocal Tract For Speaker Modelingmentioning
confidence: 99%
“…However, these nonlinear features cannot be used as a stand-alone feature to achieve a performance that is anywhere close to what state-of-the-art techniques (using the linear feature) achieve. These recent techniques are based on deep learning architectures such as Convolutional Neural Networks (CNN) 24 , Wav2Vec2.0 25 , Deeper Feature CNN-Connectionist Temporal Classification (DFCNN-CTC) 26 , Gated Recurrence Unit-CNN (GRU-CNN) 27 etc.…”
Section: Nonlinear Dynamics Of Vocal Tract For Speaker Modelingmentioning
confidence: 99%
“…Novel methods, algorithms, and more modern applications are currently being developed and improved for the segmentation of speech signals and the calculation of parametric indicators of selected fragments, thereby creating a spectrogram of speech signals using spectral analysis [ 4 , 5 , 6 , 7 ]. Speaker identification with diversified voice clips across the globe is a crucial and challenging task, especially in extracting vigorous and discriminative features [ 8 ]. A novel Mel Frequency Cepstral Coefficients (MFCC) feature extraction system that is quicker and more energy efficient than the traditional MFCC realization was proposed by Korkmaz et al [ 9 ].…”
Section: Related Workmentioning
confidence: 99%