Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1369
|View full text |Cite
|
Sign up to set email alerts
|

Few-Shot Learning of New Sound Classes for Target Sound Extraction

Abstract: Target sound extraction consists of extracting the sound of a target acoustic event (AE) class from a mixture of AE sounds. It can be realized using a neural network that extracts the target sound conditioned on a 1-hot vector that represents the desired AE class. With this approach, embedding vectors associated with the AE classes are directly optimized for the extraction of sound classes seen during training. However, it is not easy to extend this framework to new AE classes, i.e. unseen during training. Rec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 21 publications
(50 reference statements)
0
3
0
Order By: Relevance
“…With the second approach, the target SE embeddings are derived from an enrollment sample or audio query, which is a short recording of an isolated sound similar to the target SE [12], [13], [17]- [19]. Enrollment-based TSE systems learn to extract sounds that share similar characteristics to the enrollment without explicitly relying on SE class labels.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…With the second approach, the target SE embeddings are derived from an enrollment sample or audio query, which is a short recording of an isolated sound similar to the target SE [12], [13], [17]- [19]. Enrollment-based TSE systems learn to extract sounds that share similar characteristics to the enrollment without explicitly relying on SE class labels.…”
Section: Introductionmentioning
confidence: 99%
“…The initial ideas of SoundBeam were presented in our previous work [13]. In this paper, we provide a more detailed explanation of the approach and an extensive evaluation of the TSE frameworks, covering the following four practical aspects.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation