2019 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD) 2019
DOI: 10.1109/icabcd.2019.8851018
|View full text |Cite
|
Sign up to set email alerts
|

The Effects of Data Size on Text-Independent Automatic Speaker Identification System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 31 publications
0
3
0
Order By: Relevance
“…Pitch frequency is a perceptual characteristic of the speech signal with physical properties denoted by F0 and is used to improve the performance of speaker identification [23,24]. Many studies use these features for speaker identification not only separately but also in combination [25][26][27][28]. In this paper, six different feature extraction approaches, namely Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+eight spectral features, spectrograms, i-vectors, and Alexnet feature vectors were used.…”
Section: Related Workmentioning
confidence: 99%
“…Pitch frequency is a perceptual characteristic of the speech signal with physical properties denoted by F0 and is used to improve the performance of speaker identification [23,24]. Many studies use these features for speaker identification not only separately but also in combination [25][26][27][28]. In this paper, six different feature extraction approaches, namely Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+eight spectral features, spectrograms, i-vectors, and Alexnet feature vectors were used.…”
Section: Related Workmentioning
confidence: 99%
“…E ( ω ), the excitation spectrum, corresponds to high spectral variations (details) predominantly found in high quefrency, while H ( ω ), the vocal tract, accounts for low spectral variations (envelope) present at low quefrency. Evidently, research has validated the information-rich nature of the speech spectrum envelope compared to its details [ 25 ].…”
Section: Feature Extraction Stagesmentioning
confidence: 99%
“…Within this context, MFCCs emerge as the preferred choice due to their superior alignment with the human auditory system response [ 25 ]. This alignment is achieved through the Mel-scale, which takes into consideration the frequency bands of the auditory system.…”
Section: Feature Extraction Stagesmentioning
confidence: 99%