2010
DOI: 10.1504/ijbm.2010.035450
|View full text |Cite
|
Sign up to set email alerts
|

On the use of perceptual Line Spectral pairs Frequencies and higher-order residual moments for Speaker Identification

Abstract: Conventional Speaker Identification (SI) systems utilise spectral features like Mel-Frequency Cepstral Coefficients (MFCC) or PerceptualLinear Prediction (PLP) as a frontend module. Line Spectral pairs Frequencies (LSF) are popular alternative representation of Linear Prediction Coefficients (LPC). In this paper, an investigation is carried out to extract LSF from perceptually modified speech. A new feature set extracted from the residual signal is also proposed. SI system based on this residual feature contai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 43 publications
(54 reference statements)
0
4
0
Order By: Relevance
“…The results of the research may be generalized to a new finding that one efficient parameter (here the spectral slope) derived from a suitable subband of smoothed long-term spectrum is sufficient to successfully discriminate against speakers. When recognizing speakers and having long utterances available, the long-term speech spectrum can complement the traditional short-term voice features such as pitch [13], mel-frequency cepstral coefficients [11], line spectral pair frequencies [19], etc. and so help to improve recognition systems.…”
Section: Discussionmentioning
confidence: 99%
“…The results of the research may be generalized to a new finding that one efficient parameter (here the spectral slope) derived from a suitable subband of smoothed long-term spectrum is sufficient to successfully discriminate against speakers. When recognizing speakers and having long utterances available, the long-term speech spectrum can complement the traditional short-term voice features such as pitch [13], mel-frequency cepstral coefficients [11], line spectral pair frequencies [19], etc. and so help to improve recognition systems.…”
Section: Discussionmentioning
confidence: 99%
“…The LSP parameters are expressed as the zeroes (or roots) of P ( z ) and Q ( z ). The zeroes uniquely determine P ( z ), Q ( z ), and A ( z ) can be made up of P ( z ) and Q ( z ) (Sahidullah, Chakroborty, & Saha, 2010). A()z=12()0.25emP()z+Q()z0.25em0.25em …”
Section: The Features Based On Linear Prediction Analysismentioning
confidence: 99%
“…Line Spectral Pairs (LSP) are popular alternative representation of Linear Prediction Coefficients (LPC). LSPs are useful for speech coding as they have some properties that make them superior to direct quantization of LPCs [23].…”
Section: Line Spectral Frequency (Lsp)mentioning
confidence: 99%