2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5495089
|View full text |Cite
|
Sign up to set email alerts
|

Low-latency online speaker tracking on the AMI Corpus of meeting conversations

Abstract: Ambient Inteligence aims to create smart spaces providing services in a transparent and non-intrusive fashion, so context awareness and user adaptation are key issues. Speech can be exploited for user adaptation in such scenarios by continuously tracking speaker identity. However, most speaker tracking approaches require processing the full audio recording before determining speaker turns, which makes them unsuitable for online processing and low-latency decision-making. In this work a low-latency speaker trac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…This duration seems to be rational because speech utterances shorter that one second are rare. The same duration settings are applied in [11] with good achievements.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…This duration seems to be rational because speech utterances shorter that one second are rare. The same duration settings are applied in [11] with good achievements.…”
Section: Methodsmentioning
confidence: 99%
“…This has reduced the threshold dependency on the number of speakers. In [11], a speaker tracking system is presented which outputs decisions at one second intervals by scoring the segments on generic speaker models. The length of segments is empirically set to provide good time resolution and small latency for tracking.…”
Section: Introductionmentioning
confidence: 99%
“…number of speakers or speaker models). A number of online diarization [11,12,13,14,15] and speaker tracking [16,17] solutions have been reported. These use online speaker clustering algorithms [18,19].…”
Section: Prior Workmentioning
confidence: 99%
“…In Reynolds and Torres-Carrasquillo [110], MFCC features for each cluster is processed with CMN to increase robustness against channel distortion in their offline speaker diarization systems. While in Zamalloa et al [158], the dynamic CMN approach is applied in their online speaker tracking system.…”
Section: Cepstral Mean Normalizationmentioning
confidence: 99%