2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6288839
|View full text |Cite
|
Sign up to set email alerts
|

On the effect of snr and superdirective beamforming in speaker diarisation in meetings

Abstract: This paper examines the effect of sensor performance on speaker diarisation in meetings and investigates the use of more advanced beamforming techniques, beyond the typically employed delay-sum beamformer, for mitigating the effects of poorer sensor performance. We present superdirective beamforming and investigate how different time difference of arrival (TDOA) smoothing and beamforming techniques influence the performance of state-of-the-art diarisation systems. We produced and transcribed a new corpus of me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…Recent works on digital microphone arrays discuss their implementation and compare their performance to analog arrays for various signal processing tasks, such as distant speech recognition [9], recognition of overlapping speech [10], speech enhancement [11], and speaker diarisation [12]. The work in [13] presents a digital array design for aeroacoustic measurements, while the implementation of a system for sound acquisition with MEMS microphones is presented in [14].…”
Section: Introductionmentioning
confidence: 99%
“…Recent works on digital microphone arrays discuss their implementation and compare their performance to analog arrays for various signal processing tasks, such as distant speech recognition [9], recognition of overlapping speech [10], speech enhancement [11], and speaker diarisation [12]. The work in [13] presents a digital array design for aeroacoustic measurements, while the implementation of a system for sound acquisition with MEMS microphones is presented in [14].…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, meeting speech recognition (Maganti et al, 2007;Nasu et al, 2011) and meeting speaker diarization (Boakye et al, 2008;Ben-Harush et al, 2009;Stolcke et al, 2010;Sun et al, 2010;Valente et al, 2010;Vijayasenan et al, 2010;Boakye et al, 2011;Stolcke, 2011;Valente et al, 2011;Yella et al, 2011;Vijayasenan et al, 2012;Zwyssig et al, 2012) have been effectively utilized to transcribe and browse meeting procedures. However, their performance is usually low at the overlapped speech segments where more than one speaker is speaking.…”
Section: Introductionmentioning
confidence: 99%