Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-2716
|View full text |Cite
|
Sign up to set email alerts
|

LEAP Diarization System for the Second DIHARD Challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 12 publications
0
13
0
Order By: Relevance
“…The diarization baseline is based on LEAP Lab's submission to DIHARD II [27]. The system performs diarization by dividing each recording into short overlapping segments, extracting x-vectors [28,29], scoring with probabilistic linear discriminant analysis (PLDA) [30], and clustering using agglomerative hierarchical clustering (AHC) [31].…”
Section: Diarizationmentioning
confidence: 99%
See 2 more Smart Citations
“…The diarization baseline is based on LEAP Lab's submission to DIHARD II [27]. The system performs diarization by dividing each recording into short overlapping segments, extracting x-vectors [28,29], scoring with probabilistic linear discriminant analysis (PLDA) [30], and clustering using agglomerative hierarchical clustering (AHC) [31].…”
Section: Diarizationmentioning
confidence: 99%
“…The system performs diarization by dividing each recording into short overlapping segments, extracting x-vectors [28,29], scoring with probabilistic linear discriminant analysis (PLDA) [30], and clustering using agglomerative hierarchical clustering (AHC) [31]. The AHC ouput is then refined using Variational Bayes Hidden Markov Model (VB-HMM) [32,33] with posterior scaling [27]. The trained models and recipes for both tracks are distributed through GitHub 5 .…”
Section: Diarizationmentioning
confidence: 99%
See 1 more Smart Citation
“…The proposed approach also shows advancements over other published results on these datasets. Furthermore, the proposed approach can be used as an initialization for frame-level refinement based on variational Bayes (VB) hidden Markov model (HMM) [26], [27].…”
Section: Ti ( Ti )mentioning
confidence: 99%
“…In the end-to-end learning, the input features are fed to a model where the loss is either permutation-invariant cross entropy [39], [40] or clustering based [41]. Further to refine the boundaries of segmentation output in speaker diarization, a second re-segmentation step involving frame-level (20-30ms) modeling [26], [27] can be performed. This model the variational-Bayes HMM model which is typically initialized using a segment level diarization output [42].…”
Section: Related Workmentioning
confidence: 99%