2016
DOI: 10.1007/s11042-016-3335-0
|View full text |Cite
|
Sign up to set email alerts
|

Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(5 citation statements)
references
References 19 publications
0
5
0
Order By: Relevance
“…Pitch-synchronous SFF spectrogram can be used in various other applications such as speaker identification [18], [36], speaker verification [39], audio classification [41] etc. The SFF output has high SNR values of speech in the time-frequency domain.…”
Section: Discussionmentioning
confidence: 99%
“…Pitch-synchronous SFF spectrogram can be used in various other applications such as speaker identification [18], [36], speaker verification [39], audio classification [41] etc. The SFF output has high SNR values of speech in the time-frequency domain.…”
Section: Discussionmentioning
confidence: 99%
“…To evaluate the performance of the proposed speaker recognition system, experiments on 36speaker recognition were conducted. Comparing with the MFCC baseline, the proposed feature set increases the speaker recognition rates by 3.85 % and 18.49 % on clean and noisy speeches, respectively (11) .…”
Section: Introductionmentioning
confidence: 95%
“…Several studies have been conducted using both conventional machine learning and deep learning methods to enhance the accuracy of speaker recognition systems under noisy and mismatched conditions. Some of the conventional machine learning methods include the Gaussian mixture model (GMM) [14] and support vector machine (SVM) [15]. These methods have been using handcrafted features for speaker recognition and other speech analysis purposes.…”
Section: Introductionmentioning
confidence: 99%