2017 Hands-Free Speech Communications and Microphone Arrays (HSCMA) 2017
DOI: 10.1109/hscma.2017.7895582
|View full text |Cite
|
Sign up to set email alerts
|

Towards real-time source counting by estimation of coherent-to-diffuse ratios from ad-hoc microphone array recordings

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…Moonen and Bertrand [86] suggested a multi-speaker voice activity detection method that tracks the power of multiple simultaneous speakers. Coherent-to-diffuse ratio (CDR) values (32) calculated or estimated at dual microphone node locations are also applied for source counting [56].…”
Section: Source Counting and Crosstalk Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Moonen and Bertrand [86] suggested a multi-speaker voice activity detection method that tracks the power of multiple simultaneous speakers. Coherent-to-diffuse ratio (CDR) values (32) calculated or estimated at dual microphone node locations are also applied for source counting [56].…”
Section: Source Counting and Crosstalk Detectionmentioning
confidence: 99%
“…The main limitation of the method proposed by Pasha et al [56] is that all the nodes must be of the same structure, which limits the method's applicability. The MSC is found using the cross-power spectral density (CPSD) as presented by Pasha et al [87] (Figure 4):…”
Section: Source Counting and Crosstalk Detectionmentioning
confidence: 99%
“…In this research statistical cues, Kurtosis and Skewness [17], derived from the single channel LP residual signals are applied as the distance estimates. Compared with acoustic features such as Magnitude Squared Coherence (MSC) which requires dual-channel microphones of the same structure [18,19], the applied statistical features are applicable to any microphone type.…”
Section: Distance Cues For Ad-hoc Microphonesmentioning
confidence: 99%
“…Also a spatially modified beamformer is proposed and applied for the AR modelling and compression of the source signal through the joint analysis of reverberant multi-channel ad-hoc recordings. The applied statistical distance [16] and reverberation level cues [17] proposed in this work are applicable to microphones with inconsistent gains and do not require identical dual-channel microphones [18,19]. The target application of this research is the speech compression for the immersive meeting scenarios [20] where the microphones do not form a fully connected Wireless Acoustic Sensor Network (WASN) and the participants have their independent recording device and microphone.…”
Section: Introductionmentioning
confidence: 99%