2022
DOI: 10.48550/arxiv.2204.03895
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning

Abstract: In many situations, we would like to hear desired sound events (SEs) while being able to ignore interference. Target sound extraction (TSE) aims at tackling this problem by estimating the sound of target SE classes in a mixture while suppressing all other sounds. We can achieve this with a neural network that extracts the target SEs by conditioning it on clues representing the target SE classes. Two types of clues have been proposed, i.e., target SE class labels and enrollment sound samples similar to the targ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 48 publications
(129 reference statements)
0
1
0
Order By: Relevance
“…Consequently, ConceptBeam performs worse under these conditions. In future work, we will investigate approaches to mitigate this issue [4,7,22,49].…”
Section: Resultsmentioning
confidence: 99%
“…Consequently, ConceptBeam performs worse under these conditions. In future work, we will investigate approaches to mitigate this issue [4,7,22,49].…”
Section: Resultsmentioning
confidence: 99%