2021
DOI: 10.1007/978-981-15-8443-5_76
|View full text |Cite
|
Sign up to set email alerts
|

Constant Q Cepstral Coefficients and Long Short-Term Memory Model-Based Automatic Speaker Verification System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…Todisco et al [28] used CQCC feature extraction technique with two GMM, a binary classifier which was used to classify audios as genuine or spoof. Mittal et al used CQCC with CNN, LSTM, and a combination of the staticdynamic features of CQCC with LSTM-CNN ensemble in their proposed works of [3], [29]and [30], respectively. However, the issue of noise remains open with MFCC and CQCC features.…”
Section: A Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Todisco et al [28] used CQCC feature extraction technique with two GMM, a binary classifier which was used to classify audios as genuine or spoof. Mittal et al used CQCC with CNN, LSTM, and a combination of the staticdynamic features of CQCC with LSTM-CNN ensemble in their proposed works of [3], [29]and [30], respectively. However, the issue of noise remains open with MFCC and CQCC features.…”
Section: A Related Workmentioning
confidence: 99%
“…Motivated by the works of [20], [13] and [3], [29], [30], the proposed work in this paper suggests a novel integrated approach for front end feature extraction. The proposed approach uses combination of audio and image feature extraction techniques.…”
Section: A Related Workmentioning
confidence: 99%
“…Mittal et al [33] utilized CQCC at the front-end and employed two back-end models, CNN and LSTM, both individually and in combination as LSTM-CNN in their proposed work [34,35] and [33,36], respectively. Khochare et al [37] proposed a model for detecting deep fake audios, and for implementing this work fake or real datasets [38] have been used.…”
Section: Literature and Contributionmentioning
confidence: 99%
“…In various speech and speaker recognition tasks, LSTM-based deep learning mod-els are performing better than the other models. However, CNN models are also giving satisfactory results [31][32][33]. Also, different arrangements of frontend and backend models can bring smoothness and accuracy to spoof detection task.…”
Section: Introductionmentioning
confidence: 99%