2018 International Conference on Signals and Systems (ICSigSys) 2018
DOI: 10.1109/icsigsys.2018.8372769
|View full text |Cite
|
Sign up to set email alerts
|

LP residual features to counter replay attacks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…Proper choice of LP order eliminates the formant information from the residual spectrum and contains mostly excitation source information. If f s is the sampling frequency then the order of prediction p is given by f s +2 [25]. Therefore p is set to 18.…”
Section: A Lp Residualmentioning
confidence: 99%
See 1 more Smart Citation
“…Proper choice of LP order eliminates the formant information from the residual spectrum and contains mostly excitation source information. If f s is the sampling frequency then the order of prediction p is given by f s +2 [25]. Therefore p is set to 18.…”
Section: A Lp Residualmentioning
confidence: 99%
“…MFCC contain vocal tract information and R-MFCC contain excitation source information about the speaker. To compute R-MFCC first, the log magnitude spectrum of residual signal is computed and the resultant is wrapped with mel filter [25]. Then the cepstral coefficients are obtained by applying inverse discrete fourier transform.…”
Section: E R-mfccmentioning
confidence: 99%
“…In recent studies [21], [25], [27], the magnitude of LPR spectra via the discrete Fourier transform (DFT) has been proven to be powerful for replay attack detection. This is because the magnitude of replayed LPR spectra is highly affected by the distortion of recording and playback devices, which is different from the magnitude of genuine LPR spectra.…”
Section: The Motivation For Using the Phase Information Based On A Linear Prediction Analysis-based Speech Signalmentioning
confidence: 99%
“…In all these features, the decision for detecting genuine speech from replay speech is strongly based on the magnitude spectrum information of short-time spectral analysis obtained from original/raw speech. However, the magnitude spectrum information of original/raw speech via short-time spectral analysis is weak for detecting replayed signals as modern recording, and playback devices can maintain the average spectral patterns of original speech in replayed signals [21]. Thus, a source of alternative information for feature extraction is required to counter the replayed signal.…”
Section: Introductionmentioning
confidence: 99%
“…In Yu, Tan, Ma, Martin, and Guo (2017) authors use different types of filter banks such as linear, Gammatone, and its inverted version and also inverted Mel filter banks to extract different variations of cepstral coefficients for replay attack detection. Furthermore, recently, linear prediction residual-based features like linear prediction residual magnitude cepstral coefficients (Hanilç, 2018), linear prediction residual phase cepstral coefficients (Hanilç, 2018), LP-based relative phase features (Phapatanaburi, Wang, Nakagawa, & Iwahashi, 2019), and residual Mel frequency cepstral coefficients (Mishra, Singh, & Pati, 2018), have shown interesting and promising results in detection of spoofed speech.…”
mentioning
confidence: 99%