2016
DOI: 10.48550/arxiv.1604.07160
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(24 citation statements)
references
References 28 publications
0
24
0
Order By: Relevance
“…The early neural network-based method directly detects the sound events via fully connected layers [11]. With the development of deep learning in computer vision, the convolutional neural network architecture is adopted to obtain the suitable temporal frequency representations of audio [12], [13], [14], [15], [16]. Due to the sequential nature of audio, recurrent neural networks are also used to learn the long-time feature representation of audio [17].…”
Section: A Sound Event Detection In Audiomentioning
confidence: 99%
“…The early neural network-based method directly detects the sound events via fully connected layers [11]. With the development of deep learning in computer vision, the convolutional neural network architecture is adopted to obtain the suitable temporal frequency representations of audio [12], [13], [14], [15], [16]. Due to the sequential nature of audio, recurrent neural networks are also used to learn the long-time feature representation of audio [17].…”
Section: A Sound Event Detection In Audiomentioning
confidence: 99%
“…This framework maps audio bases, extracted by non-negative matrix factorization (NMF), to the detected visual objects. In recent year, audio event detection (AED) [8,29,36] has received attention in the research community. Most of the AED methods locate audio events and then classify each event.…”
Section: Related Workmentioning
confidence: 99%
“…The most widely used deep learning technique for audio classification is to apply a 2D convolutional network on the spectrogram or other derived features [12]- [14]. However, to the best of our knowledge, no other deep learning scheme has specifically addressed classification/segmentation into all of the three classes -Music, Speech and noise under a single unified framework.…”
Section: A 2d Convolutional Neural Network For Distillation and Compa...mentioning
confidence: 99%