ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683198
|View full text |Cite
|
Sign up to set email alerts
|

Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures

Abstract: Separating an audio scene into isolated sources is a fundamental problem in computer audition, analogous to image segmentation in visual scene analysis. Source separation systems based on deep learning are currently the most successful approaches for solving the underdetermined separation problem, where there are more sources than channels. Traditionally, such systems are trained on sound mixtures where the ground truth decomposition is already known. Since most real-world recordings do not have such a decompo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(22 citation statements)
references
References 29 publications
0
22
0
Order By: Relevance
“…Ensemble methods have seen rapid growth in the past decade within the machine learning community [39]. An ensemble is a group of predictors, each of which gives an estimate of a response variable.…”
Section: Background Materialsmentioning
confidence: 99%
“…Ensemble methods have seen rapid growth in the past decade within the machine learning community [39]. An ensemble is a group of predictors, each of which gives an estimate of a response variable.…”
Section: Background Materialsmentioning
confidence: 99%
“…Many techniques reported consisted of multiple stages separately optimized under different criteria, such as signal representation and embedding [15]. Some embedding clustering methods add phase information on multi-channel, and other research concerns the low-delay of deep clustering approaches [16], [17]. Orthogonal deep clustering improves the separation performance of the model by adding an orthogonal constraint penalty term of the objective function to reduce the correlation between the embedded expression [18].…”
Section: Related Work a Speech Separation Based On Deep Clusteringmentioning
confidence: 99%
“…They reported that the cACGMM performance was improved by initializing it with the pre-trained separation network. Seetharaman et al [13] designed a loss function weighted by a confidence measure of the estimated references. Drude et al [15] also proposed a novel approach that directly trains a separation network from the cACGMM likelihood.…”
Section: Unsupervised Training Of Neural Source Separationmentioning
confidence: 99%
“…Unsupervised training for neural source separation using multichannel mixture signals has recently gained a lot of attention [12][13][14][15]. One approach is to generate supervised data by using multichannel separation methods [12][13][14]. This approach suffers from the estimation errors of the multichannel methods mentioned above.…”
Section: Introductionmentioning
confidence: 99%