Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
DOI: 10.1109/icassp.1994.389309
|View full text |Cite
|
Sign up to set email alerts
|

Tree-structured speaker clustering for fast speaker adaptation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(21 citation statements)
references
References 8 publications
0
21
0
Order By: Relevance
“…In this experiment, we used a top-down clustering method based on Bhattacharyya distance [17], [23]. Of the constructed tree-structure, the root node is identical to MC-HMMs and the leaf node is the same as single-noise HMMs.…”
Section: Comparison With Other Noise Adaptation Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this experiment, we used a top-down clustering method based on Bhattacharyya distance [17], [23]. Of the constructed tree-structure, the root node is identical to MC-HMMs and the leaf node is the same as single-noise HMMs.…”
Section: Comparison With Other Noise Adaptation Methodsmentioning
confidence: 99%
“…Another noise adaptation approach is based on tree-structured clustering method [21]. In this method, treestructured clustering [17] is performed on various noise and signal-to-noise ratio (SNR) conditions. Then, based on the ML criterion, the HMM that best matches the input speech was selected by tracing the tree from top to bottom.…”
Section: Introductionmentioning
confidence: 99%
“…Relation (12) implies that the longer the distance between the means m i (m) and m j (m) and the smaller the variances σ 2 i (m) and σ 2 j (m), the higher the value of U ij . The coefficient U ij expresses numerically the law on classification abilities of probability distributions, known from detection theory.…”
Section: Calculation Of Distance D (2)mentioning
confidence: 99%
“…Naito et al (2002) proposed a clustering method based on vocal tract parameters extracted from the signal. Kosaka and Sagayama (1994) proposed the hierarchical clustering (HC) based on probability distances obtained from hidden Markov networks. On the very top of this structure, there is a model encompassing all the speakers, while individual models can be found at the very bottom.…”
Section: Introductionmentioning
confidence: 99%
“…Popular measures are the Bhattacharyya distance between output probabilities [42] and the probability of generating one speaker's data from another speaker's model after clustering [66]. Yoshizawa et al used sufficient statistics to measure the distance [26], [96].…”
Section: Speaker Clusteringmentioning
confidence: 99%