Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-572
|View full text |Cite
|
Sign up to set email alerts
|

Iterative PLDA Adaptation for Speaker Diarization

Abstract: This paper investigates iterative PLDA adaptation for crossshow speaker diarization applied to small collections of French TV archives based on an i-vector framework. Using the target collection itself for unsupervised adaptation, PLDA parameters are iteratively tuned while score normalization is applied for convergence. Performances are compared, using combinations of target and external data for training and adaptation. The experiments on two distinct target corpora show that the proposed framework can gradu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
4

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 14 publications
0
9
0
Order By: Relevance
“…In [21], the authors used HAC for speaker linking, but our preliminary experiments with TR scoring using HAC gave inconclusive results. As shown in the lower part of Figure 5, HAC gives erratic variations in terms of DER depending on the number of epochs, for BF M .…”
Section: Diarization Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In [21], the authors used HAC for speaker linking, but our preliminary experiments with TR scoring using HAC gave inconclusive results. As shown in the lower part of Figure 5, HAC gives erratic variations in terms of DER depending on the number of epochs, for BF M .…”
Section: Diarization Resultsmentioning
confidence: 99%
“…Contrary to the speaker diarization and linking framework of [21] where PLDA is used to compute similarities between ivectors, we decide to replace PLDA by a Neural Network approach. The method is inspired by that of [15] and [16], where the triplet loss [17] is used to train a neural network embedding, which aims at separating faces or speakers.…”
Section: Triplet Ranking Frameworkmentioning
confidence: 99%
“…Therefore, we recommend an informative AHC initialization method, similar to our previous paper [51]. After using PLDA to compute the log likelihood ratio between two segment i-vectors [34,35], AHC is applied to perform clustering. Using the AHC results, two prior calculation methods, hard prior and soft prior, are proposed [51].…”
Section: Ahc Initializationmentioning
confidence: 99%
“…To address it, we recommend a more robust and informative AHC initialization method. After using PLDA to compute the log likelihood ratio between two segment i-vectors [18,19], AHC is applied to get the clustering results. Based on the AHC results, two prior calculation methods, hard prior and soft prior, are proposed.…”
Section: Ahc Initializationmentioning
confidence: 99%