2004 IEEE International Conference on Acoustics, Speech, and Signal Processing 2004
DOI: 10.1109/icassp.2004.1327164
|View full text |Cite
|
Sign up to set email alerts
|

Low complexity Bayesian single channel source separation

Abstract: We propose a simple Bayesian model for performing single channel speech separation using factorized source priors in a sliding window linearly transformed domain. Using a one dimensional mixture of Gaussians to model each band source leads to fast tractable inference for the source signals. Simulations with separation of a male and female speaker using priors trained on the same speakers show comparable performance with the blind separation approach of Jang and Lee [1] with a SNR improvement of 4.9 dB for both… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2007
2007
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 2 publications
0
11
0
Order By: Relevance
“…Better separation performance may be achieved with the MAP criterion (12) and appropriate priors compared to what can be obtained with general models. Fig.…”
Section: A Principlementioning
confidence: 99%
See 3 more Smart Citations
“…Better separation performance may be achieved with the MAP criterion (12) and appropriate priors compared to what can be obtained with general models. Fig.…”
Section: A Principlementioning
confidence: 99%
“…To illustrate this issue, let us take an example where one of the sources is a voice signal (as, for instance in [5], [7], [10], and [12]). Either the voice model has been trained on a particular voice but its generalization ability tends to be poor to other voices, or it is trained on a group of voices but then it requires a large number of parameters, and even though, it tends to lack selectivity to a particular voice in a particular mix, not to mention the variability problems that can be caused by different recording and/or transmission conditions.…”
Section: B Problem Statementmentioning
confidence: 99%
See 2 more Smart Citations
“…Such conditions are that there be at least as many input channels (microphones) as sound sources, that background noise be essentially absent, and that there be an instantaneous mixing of the sources (i.e., no time delays). Even sophisticated variants like the noisy or convolutive ICA (Hyvärinen et al, 2001) are bound by rather restrictive conditions, although single-channel ICA methods have been reported to be able to separate a male from a female speaker (Jang & Lee, 2003;Beierholm, Pedersen, & Winther, 2004). ICA's underlying assumption is that the signal is a linear mixture of the contributing sources that are independent realizations of a random process.…”
Section: Introductionmentioning
confidence: 99%