2008
DOI: 10.1109/tasl.2008.2002085
|View full text |Cite
|
Sign up to set email alerts
|

Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(40 citation statements)
references
References 16 publications
0
40
0
Order By: Relevance
“…In order to group similar speech utterances together, we performed a single Gaussian based bottom-up agglomerative hierarchical clustering (AHC) proposed by Han et al (2008) with K-means post refinement, using generalized likelihood ratio (GLR) distance. The smoothness constraint was then enforced by an ad-hoc scheme of jointly classifying all utterances inside a cluster using a majority voting rule.…”
Section: Joint Classification Of Test Samplesmentioning
confidence: 99%
“…In order to group similar speech utterances together, we performed a single Gaussian based bottom-up agglomerative hierarchical clustering (AHC) proposed by Han et al (2008) with K-means post refinement, using generalized likelihood ratio (GLR) distance. The smoothness constraint was then enforced by an ad-hoc scheme of jointly classifying all utterances inside a cluster using a majority voting rule.…”
Section: Joint Classification Of Test Samplesmentioning
confidence: 99%
“…The bottom-up approach is the most popular and has achieved general success in the NIST RT evaluations [13]- [17] and on evaluation data in other domains [18], [19]. Nonetheless, some authors report that instabilities related to initialization [20], model merging and the sensitivity of the stopping criterion [21] might degrade its performance.…”
Section: A Bottom-upmentioning
confidence: 99%
“…In our experience progressive training leads to significant improvements in performance over a conventional AHC system. Cluster merging is controlled with the modified Information Change Rate (ICR) criterion [21] and continues until the stopping criterion is met. In contrast to the exemplary, state-of-the-art system presented in [28], we find that the stopping criterion gives better results than the criterion as used in [28] and [29].…”
Section: B Bottom-upmentioning
confidence: 99%
“…In [13] oracle experiments were used to analyze existing and new stopping criteria. Both the speech/non-speech classification and speaker change detection components were replaced by oracle components and only the clustering and stopping components were tested.…”
Section: A Oracle Based Experiments In Other Studiesmentioning
confidence: 99%