2018
DOI: 10.1016/j.neucom.2017.07.021
|View full text |Cite
|
Sign up to set email alerts
|

Noise robust sound event classification with convolutional neural network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
39
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 85 publications
(39 citation statements)
references
References 33 publications
0
39
0
Order By: Relevance
“…Next, the grayscale image is quantized into its RGB components; the mapping type is the in MATLAB r2016b [ 38 ]. The mapping is expressed as where is the RGB spectrogram image and is the non-linear quantization function [ 32 ]. It is worth noting that, to facilitate the observation and analysis of the RGB spectrogram image, we deploy the color mapping in this paper.…”
Section: Signal Model and Time–frequency Analysis Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Next, the grayscale image is quantized into its RGB components; the mapping type is the in MATLAB r2016b [ 38 ]. The mapping is expressed as where is the RGB spectrogram image and is the non-linear quantization function [ 32 ]. It is worth noting that, to facilitate the observation and analysis of the RGB spectrogram image, we deploy the color mapping in this paper.…”
Section: Signal Model and Time–frequency Analysis Methodsmentioning
confidence: 99%
“…First, the time–frequency analysis method based on the windowed short-time Fourier transform (STFT) [ 31 ] is employed to generate the spectrum of the MIMO-modulated signals. Then, the spectrum with different time windows is converted to a grayscale image, and this grayscale image is further transferred to the RGB spectrogram image [ 32 ]. Second, a fine-tuned AlexNet-based convolutional neural network (CNN) model is introduced to learn the features from the RGB spectrogram images.…”
Section: Introductionmentioning
confidence: 99%
“…CNN is a typical multi-layer neural network first proposed for computer vision problems and it is very suitable for image-related applications in machine learning. In recent years, the application of CNN to environment sound classification has been widely reported [22][23][24][25]. We compute STFT spectrogram in each sound frame and then feed it as an image input to the CNN classifier.…”
Section: Classical Machine Learningmentioning
confidence: 99%
“…Sound-event classification has seen a noticeable increase in interest as a field of research [1][2][3][4][5][6]. Supported by advancements in information communication technology (ICT) and convergence technology in the Industry 4.0 era [7], various industries are conducting research using sound data.…”
Section: Introductionmentioning
confidence: 99%