2008
DOI: 10.1109/icassp.2008.4518736
|View full text |Cite
|
Sign up to set email alerts
|

Voice source cepstrum coefficients for speaker identification

Abstract: We propose a novel feature set for speaker recognition that is based on the voice source signal. The feature extraction process uses closed-phase LPC analysis to estimate the vocal tract transfer function. The LPC spectrum envelope is converted to cepstrum coefficients which are used to derive the voice source features. Unlike approaches based on inverse-filtering, our procedure is robust to LPC analysis errors and low-frequency phase distortion. We have performed text-independent closed-set speaker identifica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
43
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 52 publications
(45 citation statements)
references
References 21 publications
2
43
0
Order By: Relevance
“…This is in concurrence with studies that compare vocal tract and voice source features both for other tasks [18,19] and cognitive workload [22]. Never the less the performance of the vocal tract parameters can be improved by fusing the two parameters at the output level.…”
Section: Discussionsupporting
confidence: 64%
See 1 more Smart Citation
“…This is in concurrence with studies that compare vocal tract and voice source features both for other tasks [18,19] and cognitive workload [22]. Never the less the performance of the vocal tract parameters can be improved by fusing the two parameters at the output level.…”
Section: Discussionsupporting
confidence: 64%
“…Voice source features can be extracted indirectly from the speech signal via covariance analysis and cepstrum processing without relying on inverse filtering [18,19]. Various methods for inverse filtering have however been implemented and evaluated [20] and feature extraction methods based on the glottal flow estimate have been proposed [21,22,23].…”
Section: Introductionmentioning
confidence: 99%
“…Also, baseline (modal) values of voice quality measurements such as NAQ vary significantly across speakers [28]. Spontaneous, continuous speech, typical of conversations, has voice quality characteristics that differ significantly enough across speakers that they may be used as features for automatic speaker identification systems [50,51]. For these reasons, in order to measure voice quality of a given speech segment, the measure must be normalized for the underlying speaker variation regarding age, mood, conversational context, fatigue, and other factors.…”
Section: Acoustic Measures Of Stress and Voice Qualitymentioning
confidence: 99%
“…2 The VS estimate is also used for analyzing pathological voices 3 and features extracted from its shape for speaker identification (SID). [4][5][6] Further, studies like Ref. 7 show that the VS pulse shape influences the perceived voice quality.…”
Section: Introductionmentioning
confidence: 99%