Proceedings of the 13th International Conference on Agents and Artificial Intelligence 2021
DOI: 10.5220/0010185800490056
|View full text |Cite
|
Sign up to set email alerts
|

Acoustic Anomaly Detection for Machine Sounds based on Image Transfer Learning

Abstract: In industrial applications, the early detection of malfunctioning factory machinery is crucial. In this paper, we consider acoustic malfunction detection via transfer learning. Contrary to the majority of current approaches which are based on deep autoencoders, we propose to extract features using neural networks that were pretrained on the task of image classification. We then use these features to train a variety of anomaly detection models and show that this improves results compared to convolutional autoen… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(6 citation statements)
references
References 36 publications
0
6
0
Order By: Relevance
“…The model was pre-trained with the ImageNet dataset, instead (Deng et al, 2009). ImageNet pre-trained models have been successfully used to boost the performance of CNNs models in audio classification tasks in recent years (Gwardys and Grzywczak, 2014;Müller et al, 2020;Palanisamy et al, 2020;Zhong et al, 2020;Gong et al, 2021).…”
Section: Deep Learning Algorithmsmentioning
confidence: 99%
“…The model was pre-trained with the ImageNet dataset, instead (Deng et al, 2009). ImageNet pre-trained models have been successfully used to boost the performance of CNNs models in audio classification tasks in recent years (Gwardys and Grzywczak, 2014;Müller et al, 2020;Palanisamy et al, 2020;Zhong et al, 2020;Gong et al, 2021).…”
Section: Deep Learning Algorithmsmentioning
confidence: 99%
“…An embedded spatial similarity measure using a neural network and attention mechanisms that absorb time-frequency stretching are the two main components of SPIDERnet's detection system for anomalous sounds. Instead of utilizing deep AEs, Müller et al [5] suggest neural networks that are pre-trained on the image classification task to extract features. Anomaly detection models are subsequently trained using these characteristics.…”
Section: Related Workmentioning
confidence: 99%
“…Muller et al [ 3 ] proposed the use of ImageNet pre-trained CNNs (e.g., ResNet-18 [ 12 ] and AlexNet [ 13 ]) for automatic feature extraction from Mel-spectrograms. The extracted features are subsequently fed to traditional machine learning methods such as Gaussian Mixture Models and Support Vector Machines for inference.…”
Section: Related Workmentioning
confidence: 99%
“…A variety of deep learning-driven techniques have been introduced for acoustic anomaly detection (AAD) in recent years, including dense autoencoders [ 1 , 2 ], convolutional autoencoders [ 2 ], and pre-trained convolutional neural networks [ 3 ]. Although deep learning-driven methods have demonstrated excellent accuracy in detecting anomalous sounds, the widespread adoption of these methods remains limited.…”
Section: Introductionmentioning
confidence: 99%