2007
DOI: 10.1109/tasl.2007.901823
|View full text |Cite
|
Sign up to set email alerts
|

Compensation of Nuisance Factors for Speaker and Language Recognition

Abstract: The variability of the channel and environment is one of the most important factors affecting the performance of text-independent speaker verification systems. The best techniques for channel compensation are model based. Most of them have been proposed for Gaussian Mixture Models, while in the feature domain blind channel compensation is usually performed. The aim of this work is to explore techniques that allow more accurate intersession compensation in the feature domain. Compensating the features rather th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
48
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 75 publications
(49 citation statements)
references
References 25 publications
1
48
0
Order By: Relevance
“…It turns out that, in the case of a diagonal factor analysis model, using speaker-dependent GMM's does indeed produce better results. For example, on the English language trials in the core condition, a diagonal model with 100 channel factors produces an EER of 2.8% for male speakers, which is similar to the results presented in [30], [5], [31] (but not as good as the result in the first line of Table V).…”
Section: G Note On Baum-welch Statisticssupporting
confidence: 76%
See 2 more Smart Citations
“…It turns out that, in the case of a diagonal factor analysis model, using speaker-dependent GMM's does indeed produce better results. For example, on the English language trials in the core condition, a diagonal model with 100 channel factors produces an EER of 2.8% for male speakers, which is similar to the results presented in [30], [5], [31] (but not as good as the result in the first line of Table V).…”
Section: G Note On Baum-welch Statisticssupporting
confidence: 76%
“…The results we have obtained using speaker factors are clearly much better than those obtained using d alone, but the reader may have noticed that the figures presented in the fourth rows of Tables I and II are not quite as good as the best results that have been reported with comparable standalone GMM/UBM systems as in [30], [5], [31]. These systems are comparable because they use relevance MAP for speaker enrollment and channel factors to compensate for intersession variability.…”
Section: G Note On Baum-welch Statisticsmentioning
confidence: 65%
See 1 more Smart Citation
“…Most of the approaches based on FA for language recognition are implemented with a single U matrix because the segments are well-delimited (typically in separated files) and the nature of the withinclass variability is similar for all the languages as it can be seen in [36,[46][47][48]. In [49], a segmentation system was proposed with five class location vectors (one vector per class) and a single compensation matrix U for all the classes.…”
Section: Class Model Vs Alternative Model U Matricesmentioning
confidence: 99%
“…The factor analysis [6,7,8] has been found to be very effective in reducing the channel bias in the Gaussian mixture models and universal background models (GMM-UBM) system [6]. In this paper, we use factor analysis to reduce channel bias.…”
Section: Introductionmentioning
confidence: 99%