6th European Conference on Speech Communication and Technology (Eurospeech 1999) 1999
DOI: 10.21437/eurospeech.1999-279
|View full text |Cite
|
Sign up to set email alerts
|

Detection of speaker changes in an audio document

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
0
0

Year Published

2000
2000
2013
2013

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 6 publications
0
0
0
Order By: Relevance
“…Usually GLR is applied at the first stage of a two-step implementation, over-segmenting the data [Gangadharaiah and et al, 2004], [Delacourt and Wellekens, 2000]. The most representative algorithm of the GLR applied to the speaker segmentation task is DISTBIC [Delacourt and Kryze, 1999], [Delacourt and Wellekens, 2000]. DISTBIC makes use of a two-step segmentation by firstly applying GLR followed by a BIC as refinement boundaries step.…”
Section: Metric-based Segmentationmentioning
confidence: 99%
“…Usually GLR is applied at the first stage of a two-step implementation, over-segmenting the data [Gangadharaiah and et al, 2004], [Delacourt and Wellekens, 2000]. The most representative algorithm of the GLR applied to the speaker segmentation task is DISTBIC [Delacourt and Kryze, 1999], [Delacourt and Wellekens, 2000]. DISTBIC makes use of a two-step segmentation by firstly applying GLR followed by a BIC as refinement boundaries step.…”
Section: Metric-based Segmentationmentioning
confidence: 99%
“…Such threshold needs to be tuned to the data and therefore its correct setting has been subject of constant study. Several people propose ways to automatically selecting λ, (Tritschler and Gopinath (1999), Delacourt and Wellekens (2000), Delacourt, Kryze and Wellekens (1999a), Mori and Nakagawa (2001), Lopez and Ellis (2000a), Vandecatseye, Martens et al (2004)). In Ajmera, McCowan and Bourlard (2003) a GMM is used for each of the models (M , M i and M j ) and by building the model M with the sum of models M i and M j complexities, it cancels out the penalty term avoiding the need to set any λ value.…”
Section: Metric-based Segmentationmentioning
confidence: 99%
“…This is why some people have proposed BIC as the second pass (refinement) of a 2-pass speaker segmentation system. As described earlier, an important step in this direction is taken with DISTBIC (Delacourt and Wellekens (2000), Delacourt et al (1999a), Delacourt, Kryze and Wellekens (1999b)) where the GLR is used as a first pass. Also in this direction are Zhou and Hansen (2000), Kim, Ertelt and Sikora (2005) and Tranter and Reynolds (2004), proposing the to use Hotelling's T 2 distance, and Lu and Zhang (2002a) using KL2 (Kullback-Leibler) distance.…”
Section: Metric-based Segmentationmentioning
confidence: 99%
See 1 more Smart Citation
“…The two-model hypothesis is favored if ∆BIC is positive, the ML changing point can be expressed as t = arg max i ∆BIC(i). Several works addressed the fine tuning of the penalty weight parameter [57,58,49], or it was discarded totally [59].…”
Section: Bayesian Information Criterionmentioning
confidence: 99%