2017 International Joint Conference on Neural Networks (IJCNN) 2017
DOI: 10.1109/ijcnn.2017.7966294
|View full text |Cite
|
Sign up to set email alerts
|

On the use of deep recurrent neural networks for detecting audio spoofing attacks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 27 publications
0
4
0
Order By: Relevance
“…Additionally, some of the audio spoof detection methods have been extended by working on the features which are fed into the network (Balamurali et al, 2019 ). While others have changed the networks used or have improved both networks and features (Scardapane et al, 2017 ; Alzantot et al, 2019 ; Chintha et al, 2020 ; Rahul et al, 2020 ; Wang et al, 2020b ; Luo A. et al, 2021 ). Given the fact that one of the most important deepfake detection challenges is “generalization,” researchers are highly recommended to work on generalization by changing or improving both of the networks and features as well as defining different loss functions (Chen T. et al, 2020 ; Zhang Y. et al, 2021 ).…”
Section: Discussion and Future Directionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Additionally, some of the audio spoof detection methods have been extended by working on the features which are fed into the network (Balamurali et al, 2019 ). While others have changed the networks used or have improved both networks and features (Scardapane et al, 2017 ; Alzantot et al, 2019 ; Chintha et al, 2020 ; Rahul et al, 2020 ; Wang et al, 2020b ; Luo A. et al, 2021 ). Given the fact that one of the most important deepfake detection challenges is “generalization,” researchers are highly recommended to work on generalization by changing or improving both of the networks and features as well as defining different loss functions (Chen T. et al, 2020 ; Zhang Y. et al, 2021 ).…”
Section: Discussion and Future Directionsmentioning
confidence: 99%
“…In addition, some of the audio spoof detection systems have been extended by working on the features which are fed into the network (Balamurali et al, 2019 ). While others have worked on the networks used or both of the networks and features (Scardapane et al, 2017 ; Alzantot et al, 2019 ; Chintha et al, 2020 ; Rahul et al, 2020 ; Wang et al, 2020b ; Luo A. et al, 2021 ). Therefore, besides the modeling phase, the features which are fed to the models are really challenging in the field of audio deepfake.…”
Section: Deepfake Categoriesmentioning
confidence: 99%
“…Recurrent Neural Networks (RNN) have significant advantages in dealing with time-dependent problems, utilizing gated structures and recurrent units to maintain the memory of time-series-related issues. Scardapane et al used an RNN network model to extract acoustic features from the ASVspoof2015 dataset and conducted experiments on forged audio, achieving an average EER of 2.91% [30]. Gomez-Alanis et al combined CNN and RNN networks to establish a CNN-RNN hybrid network model for detecting noisy forged audio [31].…”
Section: Related Workmentioning
confidence: 99%
“…Analysis of the deep RNN network was presented by Scardapane et al [ 50 ]. They evaluated four architectures with MFCC features, log-filter bank features, and a concatenation of these two feature sets using ASVSpoof 2015 datasets.…”
Section: Related Workmentioning
confidence: 99%