2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7952264
|View full text |Cite
|
Sign up to set email alerts
|

Weakly-supervised audio event detection using event-specific Gaussian filters and fully convolutional networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
7
3

Relationship

1
9

Authors

Journals

citations
Cited by 49 publications
(38 citation statements)
references
References 14 publications
0
38
0
Order By: Relevance
“…The choice of the pooling function is an important decision. The default choice is the "max" pooling function [12,13], which is faithful to the SMI assumption. A previous study of ours [14] has evaluated a "noisy-or" pooling function [15,16,17], and has shown it does not work for localization despite its nice probabilistic interpretation under the SMI assumption.…”
Section: Introductionmentioning
confidence: 99%
“…The choice of the pooling function is an important decision. The default choice is the "max" pooling function [12,13], which is faithful to the SMI assumption. A previous study of ours [14] has evaluated a "noisy-or" pooling function [15,16,17], and has shown it does not work for localization despite its nice probabilistic interpretation under the SMI assumption.…”
Section: Introductionmentioning
confidence: 99%
“…In [30], a mel spectrogram of an audio clip is presented to a CNN, where the filters of each convolutional layer capture local patterns of a spectrogram. After a global pooling layer such as global max pooling [28], global average pooling [31], global weighted rank pooling [23], global attention pooling [32,33] or other poolings [34,35], fully connected layers are applied to predict the presence probabilities of audio classes. Fig.…”
Section: B Convolutional Neural Network For Audio Tagging and Weaklmentioning
confidence: 99%
“…Understanding the surrounding environment through sounds has been considered as a major component in many daily applications, such as surveillance, smart city, and smart cars [1,2]. Recent years have witnessed great progress in sound event detection and classification using deep learning techniques [3][4][5][6][7]. However, most prior arts rely on standard supervised learning algorithm and may not perform well for sound events with sparse training examples.…”
Section: Introductionmentioning
confidence: 99%