An event-driven probabilistic model of sound source localization using cochlea spikes

Anumula, Jithendar; Ceolini, Enea; He, Zhe; Huber, Alfred; Liu, Shih‐Chii

doi:10.1109/iscas.2018.8351856

Cited by 17 publications

(20 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The second is M-NICA introduced for blind source separation (BSS) of source envelopes. The third is a recently proposed localization method that uses the timing of asynchronous samples from an event-driven binaural audio sensor [12]. We considered this third method because of its possible low computational complexity.…”

Section: Speaker Activity Detection For Beamforming Calibrationmentioning

confidence: 99%

“…It was recently shown how the outputs of this event-based cochlea sensor could be used to localize multiple active sources simultaneously [12]. Each output event of the sensor is assigned a probability of it being produced by a source at a particular location l ∈ L = {1, ...L} where L is the number of possible locations.…”

Section: Event-based Localization Algorithmmentioning

confidence: 99%

“…The event count feature corresponds to a moving average of events collected in a defined time window. We use this feature as an estimate of the speech envelope [12]. Once an envelope estimate is obtained for each location, the locations of the active sources have to be selected.…”

Section: Event-based Localization Algorithmmentioning

confidence: 99%

“…We extend the work in [12] by using the estimated envelopes in a SAD system. By comparing the various envelopes using (1), we construct a SAD similar to the M-NICA method.…”

Section: Event-based Localization Algorithmmentioning

confidence: 99%

“…This work proposes such a framework with the only assumption of knowing the number of speakers. First, speaker activity over short time frames of 20ms is estimated using different algorithms such as GCC-PHAT [10], multiplicative nonnegative independent components analysis (M-NICA) [11] and spike separation (SPS) [12]. From the results of the SAD, time frames assigned to one speaker are pooled together and the ATFs are then estimated using a noise-covariance whitening based ATF estimation method [6].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Speaker Activity Detection and Minimum Variance Beamforming for Source Separation

et al. 2018

Self Cite

View full text Add to dashboard Cite

This work proposes a framework that renders minimum variance beamforming blind allowing for source separation in real world environments with an ad-hoc multi-microphone setup using no assumptions other than knowing the number of speakers. The framework allows for multiple active speakers at the same time and estimates the activity of every single speaker at flexible time resolution. These estimated speaker activities are subsequently used for the calibration of the beamforming algorithm. This framework is tested with three different speaker activity detection (SAD) methods, two of which use classical algorithms and one that is event-driven. Our methods, when tested in real world reverberant scenarios, can achieve very high signal-tointerference ratio (SIR) of around 20 dB and sound quality of 0.85 in short-time objective intelligibility (STOI) close to optimal beamforming results of 22 dB SIR and 0.89 in STOI.

show abstract

Section: Speaker Activity Detection For Beamforming Calibrationmentioning

confidence: 99%

Section: Event-based Localization Algorithmmentioning

confidence: 99%

Section: Event-based Localization Algorithmmentioning

confidence: 99%

“…We extend the work in [12] by using the estimated envelopes in a SAD system. By comparing the various envelopes using (1), we construct a SAD similar to the M-NICA method.…”

Section: Event-based Localization Algorithmmentioning

confidence: 99%