The Speaker and Language Recognition Workshop (Odyssey 2016) 2016
DOI: 10.21437/odyssey.2016-22
|View full text |Cite
|
Sign up to set email alerts
|

First investigations on self trained speaker diarization

Abstract: This paper investigates self trained cross-show speaker diarization applied to collections of French TV archives, based on an i-vector/PLDA framework. The parameters used for i-vectors extraction and PLDA scoring are trained in a unsupervised way, using the data of the collection itself. Performances are compared, using combinations of target data and external data for training. The experimental results on two distinct target corpora show that using data from the corpora themselves to perform unsupervised iter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 9 publications
0
7
0
Order By: Relevance
“…When α is too small, the DER quickly reaches a plateau and the system stops improving after one iteration. This was observed in [11], where adaptation was performed with concatenation of train and target data. This is equivalent to setting α as the ratio between the two dataset, in this case, close to 0.1.…”
Section: Iterative Plda Adaptationmentioning
confidence: 90%
See 4 more Smart Citations
“…When α is too small, the DER quickly reaches a plateau and the system stops improving after one iteration. This was observed in [11], where adaptation was performed with concatenation of train and target data. This is equivalent to setting α as the ratio between the two dataset, in this case, close to 0.1.…”
Section: Iterative Plda Adaptationmentioning
confidence: 90%
“…In this paper, the target collections are too small to be solely used for unsupervised estimation of PLDA parameters [11], the use of a sufficient external labeled train corpus is mandatory to estimate those parameters. Figure 1 represents the overview of the diarization framework, including the two possible strategies for the training process: the bootstrap system (baseline) is represented with blue plain lines while the blue dashed lines correspond to the proposed adapted system.…”
Section: Iterative Adaptation Of Plda Parametersmentioning
confidence: 99%
See 3 more Smart Citations