2017 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) 2017
DOI: 10.1109/wispnet.2017.8300031
|View full text |Cite
|
Sign up to set email alerts
|

Mitigating effects of noise in Forensic Speaker Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…To determine the performance of the MFCC dual-channel and MFCC single-channel methods by calculating the values of true positive (TP), true negative (TN), false positive (FP), false negative (FN) [7]. TP is a sample of words stating true, and the test results are identical.…”
Section: š·(š‘„mentioning
confidence: 99%
See 2 more Smart Citations
“…To determine the performance of the MFCC dual-channel and MFCC single-channel methods by calculating the values of true positive (TP), true negative (TN), false positive (FP), false negative (FN) [7]. TP is a sample of words stating true, and the test results are identical.…”
Section: š·(š‘„mentioning
confidence: 99%
“…Audio tapping by the authorities is an activity to record without being noticed. Wiretapping recordings are not necessarily good quality, because they can't choose the environmental situation when the tapping takes place [6], [7]. Noisy environment, wiretapping recording is noise, the audio recording quality is low [7].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Human-computer interaction increasingly relies on Machine Learning (ML) models such as Deep Neural Networks (DNNs) trained from, usually large, datasets [1,2,3,4]. The ubiquitous applications of DNNs in security-critical tasks, such as face identity recognition [5,6], speaker verification [7,8], voice controlled systems [9,10,11] or signal forensics [12,13,14,15] require a high reliability on these computational models. However, it has been demonstrated that such models can be fooled by perturbing an input sample with malicious and quasi-imperceptible perturbations.…”
Section: Introductionmentioning
confidence: 99%
“…Forensic speaker recognition in noise for mitigating effects is discussed in [8]. In preprocessing stage voice activity detector is used to achieve the robustness.…”
Section: Introductionmentioning
confidence: 99%