ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053336
|View full text |Cite
|
Sign up to set email alerts
|

Few-Shot Acoustic Event Detection Via Meta Learning

Abstract: We study few-shot acoustic event detection (AED) in this paper. Few-shot learning enables detection of new events with very limited labeled data. Compared to other research areas like computer vision, few-shot learning for audio recognition has been under-studied. We formulate few-shot AED problem and explore different ways of utilizing traditional supervised methods for this setting as well as a variety of meta-learning approaches, which are conventionally used to solve few-shot classification problem. Compar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
33
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 48 publications
(33 citation statements)
references
References 11 publications
0
33
0
Order By: Relevance
“…At present, Meta-learning is widely active in image classification [16,[28][29], image recognition [17,30], object detection [18,[31][32], and text classification [19] and other computer vision and natural language processing fields. And it has gradually drawn much attention in speech recognition [20], audio event recognition [21,33], text-to-speech [22], speaker recognition [23] and other areas of speech signal processing. In these scenarios, samples of the new categories may be inherently scarce, or their annotation tags may be difficult to obtain.…”
Section: A Related Work In Meta-learningmentioning
confidence: 99%
“…At present, Meta-learning is widely active in image classification [16,[28][29], image recognition [17,30], object detection [18,[31][32], and text classification [19] and other computer vision and natural language processing fields. And it has gradually drawn much attention in speech recognition [20], audio event recognition [21,33], text-to-speech [22], speaker recognition [23] and other areas of speech signal processing. In these scenarios, samples of the new categories may be inherently scarce, or their annotation tags may be difficult to obtain.…”
Section: A Related Work In Meta-learningmentioning
confidence: 99%
“…C is the total number of target events. Supervised few-shot acoustic event classification [2] aims to train a model using a large set of labeled base class data C train . The goal is to detect novel class data C test with only few labeled samples for each novel event available.…”
Section: Overviewmentioning
confidence: 99%
“…Further, building personalized audio event detectors for smart home assistants such as Alexa, Google Home or Siri requires building models with extremely limited data (e.g., < 10 samples). Few-shot AEC via meta-learning has been proposed as a solution for building models in low data regimes [2].…”
Section: Introductionmentioning
confidence: 99%
“…In addition, for few-shot learning and novelty detection, some studies have utilized acoustic data to generalize an audio class with few number of examples, and detect novel audio classes. In [42], few-shot method based on meta-learning has been presented for acoustic event detection to detect the unknown acoustic classes. Also, only a few works exist using unsupervised algorithms for novelty detection.…”
Section: Related Workmentioning
confidence: 99%