Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
DOI: 10.1109/icassp.2005.1415192
|View full text |Cite
|
Sign up to set email alerts
|

Advances In Channel Compensation For SVM Speaker Recognition

Abstract: Cross-channel degradation is one of the significant challenges facing speaker recognition systems. We study the problem for speaker recognition using support vector machines (SVMs). We perform channel compensation in SVM modeling by removing non-speaker nuisance dimensions in the SVM expansion space via projections. Training to remove these dimensions is accomplished via an eigenvalue problem. The eigenvalue problem attempts to reduce multisession variation for the same speaker, reduce different channel effect… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
163
0

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 212 publications
(163 citation statements)
references
References 4 publications
0
163
0
Order By: Relevance
“…The problem of session variability compensation can be then addressed estimating the principal directions of channel variability in the development phase, and later canceling them out in the test phase, a process known as Nuissance Attribute Projection (NAP) (Solomonoff, Campbell, & Boardman, 2005). If we compute for every speaker in the development set the mean supervector (one per speaker), and every utterance is normalized subtracting its mean speaker vector, the resulting data set (known as within-scatter matrix) with all normalized utterances contains only session variability and no speaker information.…”
Section: High-dimensionality Spectral Systemsmentioning
confidence: 99%
“…The problem of session variability compensation can be then addressed estimating the principal directions of channel variability in the development phase, and later canceling them out in the test phase, a process known as Nuissance Attribute Projection (NAP) (Solomonoff, Campbell, & Boardman, 2005). If we compute for every speaker in the development set the mean supervector (one per speaker), and every utterance is normalized subtracting its mean speaker vector, the resulting data set (known as within-scatter matrix) with all normalized utterances contains only session variability and no speaker information.…”
Section: High-dimensionality Spectral Systemsmentioning
confidence: 99%
“…Nuisance Attribute Projection (NAP) is a powerful technique traditionally used in the field of speaker recognition for compensation of channel effects regardless of its source [10,9], which are assumed to lie in a low dimensional variability subspace. In others fields like biometrics at a distance and unconstrained environments, the variability sources are mostly unknown and mixed, hence, we seek to understand to what extent variability compensation techniques as NAP are useful.…”
Section: Nuisance Attribute Projection (Nap)mentioning
confidence: 99%
“…Since the speaker related information is buried under others, raw i-vectors are not sufficiently discriminative. In order to improve the discriminative capability of i-vectors, various discriminative models have been proposed, including WCCN [6], NAP [7], LDA [8] and PLDA [9]. Among these models, PLDA is regarded as the most effective approach and delivers state-of-art performance.…”
Section: Introductionmentioning
confidence: 99%