2006 International Symposium on Intelligent Signal Processing and Communications 2006
DOI: 10.1109/ispacs.2006.364899
|View full text |Cite
|
Sign up to set email alerts
|

A Speaker Count System for Telephone Conversations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2006
2006
2020
2020

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 4 publications
0
6
0
Order By: Relevance
“…Our proposed model anonymously estimates the number of people from the smartphones' acoustic cum locomotive sensing model where we have employed unsupervised learning techniques to cluster different forms of acoustic signatures. For example, Ofoegbu et al [22] have built a model from mean and covariance matrices of the linear predictive cepstral coefficient (LPCC) of voice segments in conversations and used the Mahalanobis distance to determine whether two models belong to the same or different speakers. Iyer et al [23] have performed speaker clustering using distance of the feature vectors extracted from different speakers and finally applied the modified k-means algorithm with distance metric data.…”
Section: Speaker Sensingmentioning
confidence: 99%
“…Our proposed model anonymously estimates the number of people from the smartphones' acoustic cum locomotive sensing model where we have employed unsupervised learning techniques to cluster different forms of acoustic signatures. For example, Ofoegbu et al [22] have built a model from mean and covariance matrices of the linear predictive cepstral coefficient (LPCC) of voice segments in conversations and used the Mahalanobis distance to determine whether two models belong to the same or different speakers. Iyer et al [23] have performed speaker clustering using distance of the feature vectors extracted from different speakers and finally applied the modified k-means algorithm with distance metric data.…”
Section: Speaker Sensingmentioning
confidence: 99%
“…Precisely controlling these parameters at the same time in real world experiments is often unfeasible. For this reason, we follow a common approach in the speech community and generate a separate dataset, as previously shown in [26]. Specifically, we collect audio recordings from 4 male and 4 female participants using a smartphone.…”
Section: Performance With Various Conversation Parametersmentioning
confidence: 99%
“…The closest related research to Crowd++ is [1] and [26]. Agneessens et al [1] present a pitch estimation algorithm to recognize a single speaker from audio recordings containing two speakers with 70% of the times correctly estimate the speaker count (referred to as counting accuracy).…”
Section: Speaker Countingmentioning
confidence: 99%
See 1 more Smart Citation
“…In this study, the speaker count is assumed to be known as the goal here is to simply compare the clustering performance. Moreover, separate investigations have already been performed to determine the speaker in a given conversation (Ofoegbu et al 2006b(Ofoegbu et al , 2006cIyer et al 2006). Note that the task of speaker clustering is different from SID, due to the fact that telephone conversations are analyzed, where the presence of long speaker homogeneous utterances is limited.…”
Section: Speaker Clusteringmentioning
confidence: 99%