Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation

Sawada, Hiroshi; Araki, Shoko; Mukai, Ryo; Makino, Shoji

doi:10.1109/tasl.2007.899218

Cited by 109 publications

(98 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Either A or B was the same signal as X, and the other was the comparison signal (Eq. (20)). However, subjects did not know which signals were reference or comparison.…”

Section: Subjective Resultsmentioning

confidence: 99%

“…The phase component of that vector contains geometrical information of the virtual sources. In [11], [16]- [20], this geometrical information was used to solve the permutation problem of FD-ICA.…”

Section: Grouping Virtual Signal Componentsmentioning

confidence: 99%

“…Therefore, we define operation φ(·) on the transfer function vectors to extract the relative phase at each microphone [20]:…”

Section: Grouping Virtual Signal Componentsmentioning

confidence: 99%

See 2 more Smart Citations

Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology

Niwa

Nishino

Takeda

2009

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

“…Either A or B was the same signal as X, and the other was the comparison signal (Eq. (20)). However, subjects did not know which signals were reference or comparison.…”

Section: Subjective Resultsmentioning

confidence: 99%

Section: Grouping Virtual Signal Componentsmentioning

confidence: 99%

See 1 more Smart Citation

Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology

Niwa

Nishino

Takeda

2009

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

“…This problem makes it difficult to classify the PDOA because the phase has the indeterminacy of modulus 2πk in high frequencies. [4] considered the spatial aliasing problem in a time-frequency mask approach, however, the number of sources N s should be known.…”

Section: Introductionmentioning

confidence: 99%

Stereo Source Separation and Source Counting with MAP Estimation with Dirichlet Prior Considering Spatial Aliasing Problem

Araki

Nakatani

Sawada

et al. 2009

Independent Component Analysis and Signal Separation

Self Cite

View full text Add to dashboard Cite

Abstract. In this paper, we propose a novel sparse source separation method that can estimate the number of sources and time-frequency masks simultaneously, even when the spatial aliasing problem exists. Recently, many sparse source separation approaches with time-frequency masks have been proposed. However, most of these approaches require information on the number of sources in advance. In our proposed method, we model the phase difference of arrival (PDOA) between microphones with a Gaussian mixture model (GMM) with a Dirichlet prior. Then we estimate the model parameters by using the maximum a posteriori (MAP) estimation based on the EM algorithm. In order to avoid one cluster being modeled by two or more Gaussians, we utilize a sparse distribution modeled by the Dirichlet distributions as the prior of the GMM mixture weight. Moreover, to handle wide microphone spacing cases where the spatial aliasing problem occurs, the indeterminacy of modulus 2πk in the phase is also included in our model. Experimental results show good performance of our proposed method.

show abstract

“…Beamforming attempts to improve SNR of a source using directional information [3,8]. Other approaches perform a timefrequency decomposition of the mixture signals and use between channel level and time delay differences in each time-frequency (T-F) unit to estimate an output signal that originates from a particular direction [8,12,14,18]. These systems use localization information as a primary cue to achieve source segregation, and show rapid performance degradation as reverberation is added to the recordings.…”

Section: Introductionmentioning

confidence: 99%

On the role of localization cues in binaural segregation of reverberant speech

Woodruff

Wang

2009

2009 IEEE International Conference on Acoustics, Speech and Signal Processing

View full text Add to dashboard Cite

Approaches to binaural and stereo speech segregation have often assumed that localization information can be used as a primary cue to achieve segregation of a target signal. Results produced by these systems degrade significantly in the presence of room reverberation. In this work, we present an alternative framework to achieve localization of groups of time-frequency units. We show that grouping across time and frequency allows the use of localization as an important cue for sequential grouping of time-frequency objects. We analyze the level of time-frequency grouping needed to achieve accurate object localization and show preliminary binaural segregation results using the proposed framework. Results indicate that both localization and segregation performance can be improved by grouping across time and frequency.Index Terms -Binaural sound localization, speech segregation, reverberation, computational auditory scene analysis.

show abstract

Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation

Cited by 109 publications

References 32 publications

Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology

Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology

Stereo Source Separation and Source Counting with MAP Estimation with Dirichlet Prior Considering Spatial Aliasing Problem

On the role of localization cues in binaural segregation of reverberant speech

Contact Info

Product

Resources

About