2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011
DOI: 10.1109/icassp.2011.5947361
|View full text |Cite
|
Sign up to set email alerts
|

An utterance comparison model for speaker clustering using factor analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…Another method is to simply let and in (55) include these dropped utterances while ignoring the dropped utterances in the computation of , hence reflecting the errors in . In this work, we chose the latter method (in our previous work [20], we used the former method, which is why the absolute performance figures differ; also note that in this work, we used the MACROPHONE corpus for training, which is also different).…”
Section: Speaker Clustering Experimentsmentioning
confidence: 96%
See 1 more Smart Citation
“…Another method is to simply let and in (55) include these dropped utterances while ignoring the dropped utterances in the computation of , hence reflecting the errors in . In this work, we chose the latter method (in our previous work [20], we used the former method, which is why the absolute performance figures differ; also note that in this work, we used the MACROPHONE corpus for training, which is also different).…”
Section: Speaker Clustering Experimentsmentioning
confidence: 96%
“…In Section III, a closed-form solution of the model is derived by incorporating speaker factors. We include additional details to more clearly explain the derivation that was done in [20]. Next, in Section IV, the solution is extended by adding channel factors, which results in a final model that is identical in form to the initial model but with an alternate set of sufficient statistics that is inclusive of the initial set of statistics.…”
Section: Introductionmentioning
confidence: 98%
“…Statistical methods such as Bayesian Information Criterion (e.g., [ 92 ]), Generalized Likelihood Ratio (e.g., [ 93 ]), Kullback-Leibler divergence (e.g., [ 94 ]), and Information Change Rate (e.g., [ 95 ]) are used for both segmenting and clustering the speech. Lastly, machine learning methods such as Variational Bayes (e.g., [ 96 ]), Non-parametric Bayes (e.g., [ 97 ]), and Factor Analysis (e.g., [ 98 ]) are introduced to solve the diarization problem.…”
Section: Figure A1mentioning
confidence: 99%
“…Speaker clustering is an effective tool that can alleviate the amount of speech document management tasks [5][6]. Speaker clustering can group similar audio utterance together and attribute it to the same speaker in audio document by some distance measure and clustering scheme in the unsupervised condition [7].…”
Section: Introductionmentioning
confidence: 99%