2017 Hands-Free Speech Communications and Microphone Arrays (HSCMA) 2017
DOI: 10.1109/hscma.2017.7895569
|View full text |Cite
|
Sign up to set email alerts
|

Towards acoustically robust localization of speakers in a reverberant environment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
1
1

Relationship

3
3

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 13 publications
0
6
0
Order By: Relevance
“…These parameters can be estimated using FOA signals. For example, the DOA can be estimated as originally suggested in [5] or using more advanced algorithms [23], [24], [32], and the DRR can be estimated using methods that were recently studied in the ACE challenge [33].…”
Section: Application To Directional Audio Codingmentioning
confidence: 99%
“…These parameters can be estimated using FOA signals. For example, the DOA can be estimated as originally suggested in [5] or using more advanced algorithms [23], [24], [32], and the DRR can be estimated using methods that were recently studied in the ACE challenge [33].…”
Section: Application To Directional Audio Codingmentioning
confidence: 99%
“…Several approaches for estimating the speaker DoA from the selected bins have been proposed, including MUSIC with coherent and incoherent integration of the signal subspaces from the different bins [9], and bin-wise DoA estimation followed by statistical analysis to fuse the estimates [26]- [29].…”
Section: Application To Speaker Localizationmentioning
confidence: 99%
“…In the case of a single source, the final DOA estimate can be computed as the mean of Ω coh . Alternatively, clustering the DOAs in Ω coh can be applied to eliminate outliers, or, in the case of multiple speakers, to estimate the DOA of each speaker [11], [32]. Because room reflections are coherent with the direct sound, bins that contain direct sound and reflections may still have a rank close to one, potentially degrading the performance of the coherence test, leading to errors under reverberation [11].…”
Section: A the Coherence Testmentioning
confidence: 99%
“…The aim of the analysis presented in this subsection is to provide further insight into the proposed test, by presenting the selected TF bins on top of speech spectrograms, using maps referred to as DPD maps [32]. Once again, the threshold for each test was chosen such that the percentage of TF bins that pass each test will be 10.5%, to support a common basis for comparison.…”
Section: Speech Spectrograms and Dpd Mapsmentioning
confidence: 99%
See 1 more Smart Citation