Interspeech 2015 2015
DOI: 10.21437/interspeech.2015-701
|View full text |Cite
|
Sign up to set email alerts
|

Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE

Abstract: Uncertainty propagation is an established approach to handle noisy and reverberant conditions in automatic speech recognition (ASR), but it has little been studied for speaker recognition so far. Yu et al. recently proposed to propagate uncertainty to the Baum-Welch (BW) statistics without changing the posterior probability of each mixture component. They obtained good results on a small dataset (YOHO) but little improvement on the NIST-SRE dataset, despite the use of oracle uncertainty estimates. In this pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(10 citation statements)
references
References 31 publications
0
10
0
Order By: Relevance
“…However, these approaches focused on the issue of computing representations with in-sufficient data, caused by utterances with different, possibly short durations [9][10][11][12][13][14][15]. This paper pursues the same line as our preliminary study [8], that considered an epistemic uncertainty propagation approach for noise-robust text-independent speaker verification using a system based on i-vectors [16] and probabilistic linear discriminant analysis (PLDA) [17]. Despite the recent introduction of deep learning based modules in the speaker recognition pipeline, reports of the last NIST Speaker Recognition Evaluation (SRE) in 2016 [18] and the experience in the recent campaign NIST-SRE 2018 1 evidenced that the i-vector-PLDA approach still performs among the best systems of the state-of-the-art.…”
Section: Introductionmentioning
confidence: 85%
See 3 more Smart Citations
“…However, these approaches focused on the issue of computing representations with in-sufficient data, caused by utterances with different, possibly short durations [9][10][11][12][13][14][15]. This paper pursues the same line as our preliminary study [8], that considered an epistemic uncertainty propagation approach for noise-robust text-independent speaker verification using a system based on i-vectors [16] and probabilistic linear discriminant analysis (PLDA) [17]. Despite the recent introduction of deep learning based modules in the speaker recognition pipeline, reports of the last NIST Speaker Recognition Evaluation (SRE) in 2016 [18] and the experience in the recent campaign NIST-SRE 2018 1 evidenced that the i-vector-PLDA approach still performs among the best systems of the state-of-the-art.…”
Section: Introductionmentioning
confidence: 85%
“…To the best of our knowledge, the studies in [7,8] are the only ones exploiting this model for noise and reverberation robustness in i-vector based speaker recognition systems, while other works focused on earlier, now deprecated systems. They proposed two different ways to propagate the uncertainty from the enhanced features to the i-vectors.…”
Section: Uncertainty Propagation To the I-vectormentioning
confidence: 99%
See 2 more Smart Citations
“…The authors in [25], [26] speed-up the uncertainty propagation method by grouping i-vectors together based on their reliability and by finding a representative posterior covariance matrix for each group. In [27], the authors incorporate the uncertainty associated with front-end features into the i-vector extraction framework. Finally, in [28], an extension of uncertainty decoding using simplified PLDA scoring and modified imputation is proposed.…”
Section: B Modelling I-vector Uncertaintymentioning
confidence: 99%