2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2019
DOI: 10.1109/waspaa.2019.8937203
|View full text |Cite
|
Sign up to set email alerts
|

A Style Transfer Approach to Source Separation

Abstract: Training neural networks for source separation involves presenting a mixture recording at the input of the network and updating network parameters in order to produce an output that resembles the clean source. Consequently, supervised source separation depends on the availability of paired mixture-clean training examples. In this paper, we interpret source separation as a style transfer problem. We present a variational auto-encoder network that exploits the commonality across the domain of mixtures and the do… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 25 publications
0
2
0
Order By: Relevance
“…In music, the most useful application is that of separating the lead vocals from a musical mixture. This problem is well researched and numerous deep learning based models have recently been proposed to tackle it [4,5,6,7,8,9,10,11]. Most of these models use the neural network to predict soft time frequency masks, given an input magnitude spectrogram of the mixture signal.…”
Section: Introductionmentioning
confidence: 99%
“…In music, the most useful application is that of separating the lead vocals from a musical mixture. This problem is well researched and numerous deep learning based models have recently been proposed to tackle it [4,5,6,7,8,9,10,11]. Most of these models use the neural network to predict soft time frequency masks, given an input magnitude spectrogram of the mixture signal.…”
Section: Introductionmentioning
confidence: 99%
“…To relax the constraints of paired training data, a few recent approaches interpret the problem of denoising and source separation as a style-transfer problem wherein, the goal is to map from the domain of noisy mixtures to the domain of clean sounds (Stoller et al, 2018;Michelashvili et al, 2019;Venkataramani et al, 2019). These approaches only require a training set of mixtures and a training set of clean sounds, but the clean sounds can be unpaired and unrelated to the mixtures.…”
Section: Introductionmentioning
confidence: 99%