2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6637735
|View full text |Cite
|
Sign up to set email alerts
|

Spatial and coherence cues based time-frequency masking for binaural reverberant speech separation

Abstract: Most of the binaural source separation algorithms only consider the dissimilarities between the recorded mixtures such as interaural phase and level differences (IPD, ILD) to classify and assign the time-frequency (T-F) regions of the mixture spectrograms to each source. However, in this paper we show that the coherence between the left and right recordings can provide extra information to label the T-F units from the sources. This also reduces the effect of reverberation which contains random reflections from… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 15 publications
0
11
0
Order By: Relevance
“…8 and 9. We would like to note that incorporating a precedence model would be expected to improve the performance of binaural method in reverberation as suggested by our preliminary work in [39].…”
Section: E Spatially Diffuse Noisementioning
confidence: 97%
“…8 and 9. We would like to note that incorporating a precedence model would be expected to improve the performance of binaural method in reverberation as suggested by our preliminary work in [39].…”
Section: E Spatially Diffuse Noisementioning
confidence: 97%
“…It is hard to calculate accurate auto-and cross-PSD (i.e., Φ ij (n, f )) using Equation (2) with finite-length X 1 (n, f ) and X 2 (n, f ). In previous studies [9][10][11], the PSD was estimated by multiplying exponentially decaying weight and summing continuous time-frequency bins over time as…”
Section: Interaural Coherencementioning
confidence: 99%
“…If the two microphones are apart far enough, the IC appears close to zero for diffuse sources and close to one for direct sources at most frequencies. Based on these characteristics, the performance is improved by applying IC to the direction of arrival (DoA) estimation of the speaker [9], speech, or source separation [10] and dereverberation [11][12][13] in a reverberation environment.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Considering the fact that the prior information of speech and noise can improve speech quality, our former works [26,27] have shown an effectiveness of using binaural inter-channel cues between speech and noise to enhance speech. In previous studies based on the cue parameter [28][29][30][31][32][33][34][35][36][37][38][39], the binaural inter-channel cues [28][29][30][31][32][33][34][35][36][37] have been used to estimate ideal T-F mask in binaural computational auditory scene analysis (CASA) systems and have shown a good performance in binaural speech processing. In the BCC technique [40][41][42], the binaural inter-channel cues were viewed as the side information, which was combined with a down-mixed audio signal to recover the left channel and right channel audio signals.…”
Section: Introductionmentioning
confidence: 99%