Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-0034
|View full text |Cite
|
Sign up to set email alerts
|

Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss

Abstract: Many recent source separation systems are designed to separate a fixed number of sources out of a mixture. In the cases where the source activation patterns are unknown, such systems have to either adjust the number of outputs or to identify invalid outputs from the valid ones. Iterative separation methods have gain much attention in the community as they can flexibly decide the number of outputs, however (1) they typically rely on long-term information to determine the stopping time for the iterations, which … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(15 citation statements)
references
References 25 publications
0
15
0
Order By: Relevance
“…One drawback of this approach compared to ours is that the sourcerecurrent network needs to be run N times to separate N sources, while our proposed network only needs to run once. More recently, Luo and Mesgarani [9] proposed a separation model trained to predict the mixture for inactive output sources. In contrast, our approach trains the separation model to predict zero for inactive sources, making it easier to determine when sources are inactive.…”
Section: Relation To Prior Workmentioning
confidence: 99%
“…One drawback of this approach compared to ours is that the sourcerecurrent network needs to be run N times to separate N sources, while our proposed network only needs to run once. More recently, Luo and Mesgarani [9] proposed a separation model trained to predict the mixture for inactive output sources. In contrast, our approach trains the separation model to predict zero for inactive sources, making it easier to determine when sources are inactive.…”
Section: Relation To Prior Workmentioning
confidence: 99%
“…This could be addressed within our spatially distributed framework by adapting a dedicated strategy, e.g. in estimating the sources iteratively [21] or by adapting the loss function [22,23].…”
Section: Performance In Under-determined Casesmentioning
confidence: 99%
“…A2PIT is used in the source separation task with a variable number of sources [18]. With the scheme, the mixture signal itself is used as the auxiliary targets instead of low-energy random noise signals when there are fewer targets than tracks [18]. That enables the model to use any objective functions such as scale-invariant signalto-distortion ratio (SI-SDR).…”
Section: Related Workmentioning
confidence: 99%
“…The idea of replacing auxiliary targets inspires us to use another auxiliary target instead of a zero vector in the SELD task. A similarity threshold between the mixture and the outputs was used during inference to determine valid outputs [18].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation