2008
DOI: 10.1109/tasl.2008.925147
|View full text |Cite
|
Sign up to set email alerts
|

A Study of Interspeaker Variability in Speaker Verification

Abstract: Abstract-We propose a new approach to the problem of estimating the hyperparameters which define the inter-speaker variability model in joint factor analysis. We tested the proposed estimation technique on the NIST 2006 speaker recognition evaluation data and obtained 10-15% reductions in error rates on the core condition and the extended data condition (as measured both by equal error rates and the NIST detection cost function). We show that when a large joint factor analysis model is trained in this way and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
369
0
5

Year Published

2011
2011
2016
2016

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 532 publications
(375 citation statements)
references
References 26 publications
1
369
0
5
Order By: Relevance
“…The i-vectors are also length-normalised [7]. Details about training of total variability space matrix T can be seen in [13] or [14]. Because of the differences between each conversation (and the similarity in one conversation), we also compute a conversation dependent PCA transformation, which further reduces the dimensionality of the i-vector w. The dimension of the PCA latent space is dependent on the parameter p, the ratio of eigenvalue mass [8] (in our case p = 0.5).…”
Section: I-vectorsmentioning
confidence: 99%
“…The i-vectors are also length-normalised [7]. Details about training of total variability space matrix T can be seen in [13] or [14]. Because of the differences between each conversation (and the similarity in one conversation), we also compute a conversation dependent PCA transformation, which further reduces the dimensionality of the i-vector w. The dimension of the PCA latent space is dependent on the parameter p, the ratio of eigenvalue mass [8] (in our case p = 0.5).…”
Section: I-vectorsmentioning
confidence: 99%
“…Kenny [13] provides a more detailed explanation of the derivation of these parameters, using the EM algorithm.…”
Section: I-vector Extractionmentioning
confidence: 99%
“…The i-vector is a compact representation (typically from 400 to 600 dimensions) of a whole utterance, derived as a point estimate of the latent variables in a factor analysis model [12] [13]. However, while proven to be successful in a variety of scenarios, i-vector based approaches suffer from two major drawbacks when coping with real-time applications.…”
Section: Introductionmentioning
confidence: 99%
“…Factor analysis has led to the development of an effective method of compensating for intersession variability in speaker verification. Joint Factor Analysis (JFA) [3], [4] and total variability factor analysis [5], [6] have been successfully applied in speaker verification.…”
Section: Introductionmentioning
confidence: 99%
“…The techniques of intersession compensation of LDA and WCCN are briefly introduced in Sect. 4. Experiments and results are presented in Sect.…”
Section: Introductionmentioning
confidence: 99%