2011
DOI: 10.1109/tasl.2010.2052244
|View full text |Cite
|
Sign up to set email alerts
|

A Region-Growing Permutation Alignment Approach in Frequency-Domain Blind Source Separation of Speech Mixtures

Abstract: Abstract-The convolutive blind source separation (BSS) problem can be solved efficiently in the frequency domain, where instantaneous BSS is performed separately in each frequency bin. However, the permutation ambiguity in each frequency bin should be resolved so that the separated frequency components from the same source are grouped together. To solve the permutation problem, this paper presents a new alignment method based on an inter-frequency dependence measure: the powers of separated signals. Bin-wise p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2013
2013
2016
2016

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 51 publications
(29 citation statements)
references
References 24 publications
0
29
0
Order By: Relevance
“…Nesta et al pro-posed an extension for underdetermined scenario where multiple complex valued ICA adaptations jointly estimate the mixing matrix and the temporal activities of multiple sources in each frequency band to exploit the spectral sparsity of speech signals (Nesta and Omologo, 2012). The method does not explicitly rely on identification of the acoustic channel and recovery of the desired source imposes a permutation problem due to mis-alignment of the individual source components (Nesta and Omologo, 2012;Wang et al, 2011). Other extensions of ICA for the underdetermined scenarios consist in integration with sparse masking techniques within a hierarchical separation framework (Araki et al, 2004;Davies and Mitianoudis, 2004).…”
Section: Prior Workmentioning
confidence: 99%
“…Nesta et al pro-posed an extension for underdetermined scenario where multiple complex valued ICA adaptations jointly estimate the mixing matrix and the temporal activities of multiple sources in each frequency band to exploit the spectral sparsity of speech signals (Nesta and Omologo, 2012). The method does not explicitly rely on identification of the acoustic channel and recovery of the desired source imposes a permutation problem due to mis-alignment of the individual source components (Nesta and Omologo, 2012;Wang et al, 2011). Other extensions of ICA for the underdetermined scenarios consist in integration with sparse masking techniques within a hierarchical separation framework (Araki et al, 2004;Davies and Mitianoudis, 2004).…”
Section: Prior Workmentioning
confidence: 99%
“…Since ICA in different layers may output the separated results in different order, the permutation ambiguity will occur in FDBSS, which is indicated by the different color of y [f] in Figure 1a. The permutation ambiguity must be carefully addressed by algorithms like [7][8][9][10][11][12] before the inverse STFT is performed, or else the separation procedure will fail.…”
Section: From Fdbss To Ivamentioning
confidence: 99%
“…in neighboring frequency bins, and this feature is often used to solve the permutation problem [8][9][10]. When the subband technique proposed in the previous section is used, stronger correlations are expected to be observed because of the local dependency property of the data.…”
Section: Algorithm 1 the Subband Iva Algorithmmentioning
confidence: 99%
See 2 more Smart Citations