DOI: 10.1007/978-3-540-72849-8_7
|View full text |Cite
|
Sign up to set email alerts
|

A Simple But Effective Approach to Speaker Tracking in Broadcast News

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…The task and evaluation measures were cast in a detection and retrieval framework, respectively, and can be compared to a known item retrieval task in text retrieval. Most speaker tracking systems solve the task by performing speaker diarization followed by speaker detection [9], [10]. Although this problem is similar to our approach for large scale diarization, there are two important differences.…”
Section: Related Workmentioning
confidence: 99%
“…The task and evaluation measures were cast in a detection and retrieval framework, respectively, and can be compared to a known item retrieval task in text retrieval. Most speaker tracking systems solve the task by performing speaker diarization followed by speaker detection [9], [10]. Although this problem is similar to our approach for large scale diarization, there are two important differences.…”
Section: Related Workmentioning
confidence: 99%
“…In speaker tracking, the task is to find spoken segments of a particular speaker for which some training material is given. Most speaker tracking systems solve the task by performing speaker segmentation followed by speaker detection [5,6]. Because only a selection of a-priorly known people are tracked, labeling clusters with corresponding names is straightforward.…”
Section: Related Workmentioning
confidence: 99%
“…This involves applying a threshold τ and forcing a minimum segment size δ. In practice, a boundary t is validated when its cross-likelihood ratio exceeds τ and there is no candidate boundary with greater ratio in the interval [t-δ,t+δ] (see [13] for details).…”
Section: Audio Segmentationmentioning
confidence: 99%