Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-1692
|View full text |Cite
|
Sign up to set email alerts
|

Surgical Mask Detection with Convolutional Neural Networks and Data Augmentations on Spectrograms

Abstract: In many fields of research, labeled data-sets are hard to acquire. This is where data augmentation promises to overcome the lack of training data in the context of neural network engineering and classification tasks. The idea here is to reduce model over-fitting to the feature distribution of a small under-descriptive training data-set. We try to evaluate such data augmentation techniques to gather insights in the performance boost they provide for several convolutional neural networks on mel-spectrogram repre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 13 publications
0
9
0
Order By: Relevance
“…Illium et al. [60] explore a method that tries using Mel-spectrograms as features representing audio, and then employ some data augmentation technique combined with a CNN in order to solve the classification task at hand. Many augmentation techniques and CNN architectures are explored, and the best combination is used.…”
Section: Challenge Results and Contributionsmentioning
confidence: 99%
See 3 more Smart Citations
“…Illium et al. [60] explore a method that tries using Mel-spectrograms as features representing audio, and then employ some data augmentation technique combined with a CNN in order to solve the classification task at hand. Many augmentation techniques and CNN architectures are explored, and the best combination is used.…”
Section: Challenge Results and Contributionsmentioning
confidence: 99%
“… [38] 71.8 16.5 40.0 Deep Spectrum , BoWA, ComParE , auDeep 19 Illium et al. [60] 71.5 34.7 22.3 CNNs with multiple augmentations …”
Section: Challenge Results and Contributionsmentioning
confidence: 99%
See 2 more Smart Citations
“…Note that our strategy takes a much less complicated pipeline for feature extraction compared with Ref. [18][19][20]. The contributions of this paper can be summarised as making use of low-level aggregation to achieve data augmentation and avoiding using external pre-trained models.…”
Section: Introductionmentioning
confidence: 99%