2021
DOI: 10.1007/s40747-021-00565-w
|View full text |Cite
|
Sign up to set email alerts
|

Static–dynamic features and hybrid deep learning models based spoof detection system for ASV

Abstract: Detection of spoof is essential for improving the performance of current scenario of Automatic Speaker Verification (ASV) systems. Empowerment to both frontend and backend parts can build the robust ASV systems. First, this paper discuses performance comparison of static and static–dynamic Constant Q Cepstral Coefficients (CQCC) frontend features by using Long Short Term Memory (LSTM) with Time Distributed Wrappers model at the backend. Second, it performs comparative analysis of ASV systems built using three … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…Mittal et al [33] utilized CQCC at the front-end and employed two back-end models, CNN and LSTM, both individually and in combination as LSTM-CNN in their proposed work [34,35] and [33,36], respectively. Khochare et al [37] proposed a model for detecting deep fake audios, and for implementing this work fake or real datasets [38] have been used.…”
Section: Literature and Contributionmentioning
confidence: 99%
“…Mittal et al [33] utilized CQCC at the front-end and employed two back-end models, CNN and LSTM, both individually and in combination as LSTM-CNN in their proposed work [34,35] and [33,36], respectively. Khochare et al [37] proposed a model for detecting deep fake audios, and for implementing this work fake or real datasets [38] have been used.…”
Section: Literature and Contributionmentioning
confidence: 99%
“…In contrast, a network combining three dense layers and three LSTM layers with MFCC features performed well, with 2.91% EER. Mittal and Dua [ 51 ] presented a hybrid deep CNN using static and dynamic CQCC features sets. Hybrid CNN combined the CNN-LSTM model with a time-distributed wrapper integrated into the LSTM network.…”
Section: Related Workmentioning
confidence: 99%
“…Meng et al [31] proposed an anti-spoofing measure for smart home systems called ARRAYID which detects the passive liveness that uses the collated speech to distinguish between a live human and the replayed speech. Mittal and Dua [32], explored the deep learning models, CNN and LSTM, and the CQCC spectral feature. Two levels were used to detect spoofs.…”
Section: Literature Reviewmentioning
confidence: 99%