2006 International Symposium on Intelligent Signal Processing and Communications 2006
DOI: 10.1109/ispacs.2006.364902
|View full text |Cite
|
Sign up to set email alerts
|

Blind Speaker Clustering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2006
2006
2016
2016

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…In telephone conversations, homogeneous speaker utterances are short because speakers change relatively frequently (Iyer et al 2006;Ofoegbu et al 2006a); moreover, information about speaker change points is not usually available. Consequently, increasing the length of data segments used in comparison does not necessarily result in an increase in speaker differentiation performance, as data from more than one speaker could be combined in the same segment.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In telephone conversations, homogeneous speaker utterances are short because speakers change relatively frequently (Iyer et al 2006;Ofoegbu et al 2006a); moreover, information about speaker change points is not usually available. Consequently, increasing the length of data segments used in comparison does not necessarily result in an increase in speaker differentiation performance, as data from more than one speaker could be combined in the same segment.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, the speaker count is assumed to be known as the goal here is to simply compare the clustering performance. Moreover, separate investigations have already been performed to determine the speaker in a given conversation (Ofoegbu et al 2006b(Ofoegbu et al , 2006cIyer et al 2006). Note that the task of speaker clustering is different from SID, due to the fact that telephone conversations are analyzed, where the presence of long speaker homogeneous utterances is limited.…”
Section: Speaker Clusteringmentioning
confidence: 99%
“…In contrast, SensePresence's occupancy counting process is entirely unsupervised. The authors of [11] used unsupervised techniques to perform speaker clustering using distances of the feature vectors extracted from different speakers. However this occupant estimation has been done only on telephonic conversational data where our proposed system, SensePresence performs speaker counting without any staged conversational setup.…”
Section: Related Workmentioning
confidence: 99%
“…Owing to the lack of knowledge regarding the real number of speakers, the discussed problem can be interpreted as an unsupervised learning problem. Classic solutions to the problem formulated in this way are based on the concept of hierarchical clustering, where different numbers of groups are checked and various distance measures are used: Mahalanobis distance (Iyer et al, 2006), Bhattacharyya distance (Basseville, 1989), Hellinger distance (Lu et al, 2003), and Generalized Likelihood Ratio (GLR) (Anderson, 2003). Further development of the discussed HC methods has given rise to the clustering algorithm applying Leader Following Concept (LFC) to k-means algorithm (Duda et al, 2001 …”
Section: Introductionmentioning
confidence: 99%