2007
DOI: 10.1109/tasl.2007.894525
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Speaker Clustering Using a Voice Characteristic Reference Space and Maximum Purity Estimation

Abstract: This paper investigates the problem of automatically grouping unknown speech utterances based on their associated speakers. In attempts to determine which utterances should be grouped together, it is necessary to measure the voice similarities between utterances. Since most existing methods measure the inter-utterance similarities based directly on the spectrum-based features, the resulting clusters may not be well-related to speakers, but to various acoustic classes instead.This study remedies this shortcomin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2008
2008
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…where is the mixture occupation probability of Gaussian component at time : (21) Applying (15) to the first two Gaussian terms in (19), and , we have…”
Section: B Derivation Of a Closed Form Solutionmentioning
confidence: 99%
See 3 more Smart Citations
“…where is the mixture occupation probability of Gaussian component at time : (21) Applying (15) to the first two Gaussian terms in (19), and , we have…”
Section: B Derivation Of a Closed Form Solutionmentioning
confidence: 99%
“…We compared our model with two well-known similarity metrics: the Generalized Likelihood Ratio (GLR) and the Cross-Likelihood Ratio (CLR). The Generalized Likelihood Ratio (GLR) [21] between some utterance and some utterance is defined as (53) where is a single set of model parameters trained on both and merged together. The Cross-Likelihood Ratio [16] is (54) A previous study used either GMMs or a VQ model [16] for the parameters and , but here, we simply used the speaker factor vector obtained via ML-estimation [19] to build an adapted GMM for each utterance via (5).…”
Section: Speaker Clustering Experimentsmentioning
confidence: 99%
See 2 more Smart Citations
“…Recently, Tsai et al [6] presented an automatic speaker clustering algorithm using a voice characteristic reference space and maximum purity estimation, with the aim of maximizing the similarities between utterances within clusters. Tang et al [7] applied a complete treatment for a partially supervised speaker clustering to assist the unsupervised speaker clustering process.…”
Section: Introductionmentioning
confidence: 99%