2006 IEEE Odyssey - The Speaker and Language Recognition Workshop 2006
DOI: 10.1109/odyssey.2006.248116
|View full text |Cite
|
Sign up to set email alerts
|

How to Deal with Multiple-Targets in Speaker Identification Systems?

Abstract: In open-set speaker identification systems a known phenomenon is that the false alarm (accept) error rate increases dramatically when increasing the number of registered speakers (models). In this paper, we demonstrate this phenomenon and suggest a solution using a new modeldependent score-normalization technique, called Top-norm. The Top-norm method was specifically developed to improve results of open-set speaker identification systems. Also, we suggest a score-normalization parameter adaptation technique. E… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
28
0

Year Published

2008
2008
2022
2022

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 22 publications
(28 citation statements)
references
References 6 publications
0
28
0
Order By: Relevance
“…When T 60 = 0. 5 of n seem to be small. However, CMS performance may not be explained entirely by MDR values.…”
Section: V-c the Effect Of Mdr On Svrmentioning
confidence: 84%
See 1 more Smart Citation
“…When T 60 = 0. 5 of n seem to be small. However, CMS performance may not be explained entirely by MDR values.…”
Section: V-c the Effect Of Mdr On Svrmentioning
confidence: 84%
“…The presence of reverberation adds distortion to the feature vectors, which results in performance degradation of SVR and thus of speaker recognition, due to mismatched conditions between trained models and test segments [3]. Feature normalization techniques such as cepstral mean subtraction (CMS) and score normalization techniques such as the Znorm, Hnorm, Tnorm [4] and Top-norm [5] may be used to reduce the effect of reverberation which is characterized by a short-duration room impulse response (RIR). However, they might not be as useful to reduce the effect of long-duration RIR, which is often the case in room acoustics [6].…”
Section: Introductionmentioning
confidence: 99%
“…Multitarget detection such as blacklist or watchlist was often described as open-set speaker identification. There are a few relevant studies [2,3,4,5,6,7], but it is not actively being explored because it is regarded as a special case of speaker verification. Most research on this topic pre-date the i-vector [8], so it is difficult to compare the performance of older blacklist detection systems with state-of-the-art technology.…”
Section: Introductionmentioning
confidence: 99%
“…The presence of reverberation adds distortion to the feature vectors, which results in performance degradation of SVR systems due to mismatched conditions between trained models and test segments. Feature normalization techniques such as the cepstral mean subtraction (CMS) [Mammone et al, 1996] and variance normalization [Chen & Bilmes, 2007], and score normalization techniques such as the Znorm, Hnorm, Tnorm [Bimbot et al, 2004, Mammone et al, 1996 and Top-norm [Zigel & Wasserblat, 2006], were originally developed to compensate for the effect of a telephone channel [Mammone et al, 1996], or for the effect of slowly varying convolutive noises in general [Reynolds et al, 2000]. For that reason, these techniques may be used to reduce the effect of reverberation, if it is characterized by a short-duration room impulse response (RIR).…”
Section: Introductionmentioning
confidence: 99%