2009
DOI: 10.1109/tasl.2009.2015698
|View full text |Cite
|
Sign up to set email alerts
|

An Information Theoretic Approach to Speaker Diarization of Meeting Data

Abstract: Abstract-A speaker diarization system based on an information theoretic framework is described. The problem is formulated according to the Information Bottleneck (IB) principle. Unlike other approaches where the distance between speaker segments is arbitrarily introduced, the IB method seeks the partition that maximizes the mutual information between observations and variables relevant for the problem while minimizing the distortion between observations. This solves the problem of choosing the distance between… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
122
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 94 publications
(122 citation statements)
references
References 20 publications
0
122
0
Order By: Relevance
“…Speaker diarization system used in the current work is based on a non-parametric bottom-up agglomerative framework [19]. The diarization output assigns each speech segment to a unique cluster (speaker) in the output.…”
Section: Baseline Speaker Diarization Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…Speaker diarization system used in the current work is based on a non-parametric bottom-up agglomerative framework [19]. The diarization output assigns each speech segment to a unique cluster (speaker) in the output.…”
Section: Baseline Speaker Diarization Systemmentioning
confidence: 99%
“…Overlap labelling reduces the missed speech error [7,8,11]. Table 1 (first row) shows DER of 30.4 for the speaker diarization system without any overlap handling as described in [19]. Let us now compare the results obtained by the baseline overlap detector and the proposed system which exploits information from speaker changes on three tasks overlap exclusion, labelling and both.…”
Section: Experiments On Overlap Speaker Diarizationmentioning
confidence: 99%
“…This section briefly summarizes the Information Bottleneck speaker diarization system that operates in a normalized space of relevance variables proposed in [9]. The Information Bottleneck is a distributional clustering technique introduced in [10].…”
Section: Information Bottleneck Diarizationmentioning
confidence: 99%
“…In order to apply this method to speaker diarization, the set of relevance variables Y = {yn} is defined as the components of a background GMM (M) trained on the entire audio recording [9]. The input to the clustering algorithm is uniformly segmented speech segments xt.…”
Section: Information Bottleneck Diarizationmentioning
confidence: 99%
“…Clustering algorithms are typically based on the Bayesian information criterion (BIC) distance [12] although recent studies have also presented great improvements using other alternatives based on the t-test distance [13]. Most systems extract spectral features related to the spectral envelope such as the Mel frequency cepstral coefficients (MFCCs) [9], [14], although some studies have presented improvements with the fusion of spectral envelope and pitch features [15]. In [16], an exhaustive analysis of the goodness of prosodic and long-term features in speaker diarization is presented.…”
Section: Introductionmentioning
confidence: 99%