2012 IEEE Spoken Language Technology Workshop (SLT) 2012
DOI: 10.1109/slt.2012.6424236
|View full text |Cite
|
Sign up to set email alerts
|

Speaker diarization and linking of large corpora

Abstract: Performing speaker diarization of a collection of recordings, where speakers are uniquely identified across the database, is a challenging task. In this context, inter-session variability compensation and reasonable computation times are essential to be addressed. In this paper we propose a two-stage system composed of speaker diarization and speaker linking modules that are able to perform data set wide speaker diarization and that handle both large volumes of data and inter-session variability compensation. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
19
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(19 citation statements)
references
References 10 publications
0
19
0
Order By: Relevance
“…ILP is also integrated into the open-source toolbox for broadcast news diarization LIUM [32]. In the case of the diarization of a big collection of recordings, the clusters which are defined after applying diarization separately in each recording can be used to perform a two-stage clustering approach, and compress the information [37].…”
Section: Speaker Diarizationmentioning
confidence: 99%
“…ILP is also integrated into the open-source toolbox for broadcast news diarization LIUM [32]. In the case of the diarization of a big collection of recordings, the clusters which are defined after applying diarization separately in each recording can be used to perform a two-stage clustering approach, and compress the information [37].…”
Section: Speaker Diarizationmentioning
confidence: 99%
“…For the annotation of speaker identities in the raw broadcast data, one viable approach is to link the speakers [37,38] appearing in the reference data with known identities to the (pseudo) speaker labels provided by a speaker diarization system. Speaker linking is achieved by clustering all speakers appearing in the reference and raw broadcast data based on the speaker similarity scores assigned by a speaker recognition (SR) system [39].…”
Section: Introductionmentioning
confidence: 99%
“…Application domains of cross-show diarization include radio and TV [1,2,3,4,5], phone [6,7,8,9] or meeting recordings [10]. The state-of-the-art systems based on i-vector/PLDA framework require speaker annotated datasets including speaker segments and identities.…”
Section: Introductionmentioning
confidence: 99%