2021
DOI: 10.3390/app11114880
|View full text |Cite
|
Sign up to set email alerts
|

A Study of Features and Deep Neural Network Architectures and Hyper-Parameters for Domestic Audio Classification

Abstract: Recent methodologies for audio classification frequently involve cepstral and spectral features, applied to single channel recordings of acoustic scenes and events. Further, the concept of transfer learning has been widely used over the years, and has proven to provide an efficient alternative to training neural networks from scratch. The lower time and resource requirements when using pre-trained models allows for more versatility in developing system classification approaches. However, information on classif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 49 publications
0
3
0
Order By: Relevance
“…DAG networks are characterized by their complex architecture, which enables the utilization of the output from one layer as input for multiple other layers. Although this type of architecture is more complex and limits the overall flexibility of the neural network for future modifications, it offers advantages regarding downsizing the overall model size upon export [40]. Another type of neural network architecture is the series architecture, which involves a one-input-one-output mechanism.…”
Section: Pre-trained Neural Network Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…DAG networks are characterized by their complex architecture, which enables the utilization of the output from one layer as input for multiple other layers. Although this type of architecture is more complex and limits the overall flexibility of the neural network for future modifications, it offers advantages regarding downsizing the overall model size upon export [40]. Another type of neural network architecture is the series architecture, which involves a one-input-one-output mechanism.…”
Section: Pre-trained Neural Network Modelsmentioning
confidence: 99%
“…Due to the simpler connectivity, this type of architecture often results in a larger model. Nonetheless, it provides advantages in terms of more flexibility, particularly for hyper-parameter variations and model fine-tuning applications [40].…”
Section: Pre-trained Neural Network Modelsmentioning
confidence: 99%
“…The eighth paper (Copiaco et al ( 2021)) [8] presents a detailed study of the most apparent and widely-used cepstral and spectral features for multi-channel audio applications. Additionally, the paper details the development of a compact version of the AlexNet model for computationally limited platforms through studies of performances against various architectural and parameter modifications of the original network.…”
Section: Published Papersmentioning
confidence: 99%
“…This model showcases the effectiveness of deep CNNs for image classification tasks, setting a new standard for supervised learning without unsupervised pre-training. Several studies have used AlexNet in the audio classification domain using spectrograms with state-of-the-art performance [ 23 , 24 , 25 ].…”
Section: Introductionmentioning
confidence: 99%