2022
DOI: 10.48550/arxiv.2210.08610
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Robust, General, and Low Complexity Acoustic Scene Classification Systems and An Effective Visualization for Presenting a Sound Scene Context

Abstract: In this paper, we present a comprehensive analysis of Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording from its acoustic signature.In particular, we firstly propose an inception-based and lowfootprint ASC model, referred to as the ASC baseline. The proposed ASC baseline is then compared with benchmark and high-complexity network architectures of MobileNetV1,

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…Regarding the teacher architecture, it comprises two main parts: a CNN-based backbone followed by a dense block. The CNN-based backbone, which presents a residual-inception based architecture, is reused from [21], [10]. The dense block comprises two dense layers (Dense Layer 01 and Dense Layer 02), which is shown in the lower part of Fig.…”
Section: The Teacher Network Architecturementioning
confidence: 99%
“…Regarding the teacher architecture, it comprises two main parts: a CNN-based backbone followed by a dense block. The CNN-based backbone, which presents a residual-inception based architecture, is reused from [21], [10]. The dense block comprises two dense layers (Dense Layer 01 and Dense Layer 02), which is shown in the lower part of Fig.…”
Section: The Teacher Network Architecturementioning
confidence: 99%
“…Regarding the residual-inception based network for training audio spectrograms, it is separated into two main parts: A Residual-Inception block and a Dense block. The Residual-Inception block in this paper is the CNN-based backbone of the novel residual-inception deep neural network architecture which is reused from our previous works in [15]. Meanwhile, with RGB format.…”
Section: A Phase I: Train Deep Learning Models On Individual Audio Or...mentioning
confidence: 99%