2018
DOI: 10.1007/978-3-030-00767-6_49
|View full text |Cite
|
Sign up to set email alerts
|

Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features

Abstract: Motivated by the fact that characteristics of different sound classes are highly diverse in different temporal scales and hierarchical levels, a novel deep convolutional neural network (CNN) architecture is proposed for the environmental sound classification task. This network architecture takes raw waveforms as input, and a set of separated parallel CNNs are utilized with different convolutional filter sizes and strides, in order to learn feature representations with multi-temporal resolutions. On the other h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(13 citation statements)
references
References 24 publications
0
13
0
Order By: Relevance
“…The best results were produced when Gammatone cepstral coefficients were combined TEO-GTSC with score-level fusion. A multi-temporal resolution CNN was proposed in [34]. Here, multiple CNNs with different filter sizes and stride lengths work on a raw audio signal on different temporal resolutions, in parallel.…”
Section: Related Workmentioning
confidence: 99%
“…The best results were produced when Gammatone cepstral coefficients were combined TEO-GTSC with score-level fusion. A multi-temporal resolution CNN was proposed in [34]. Here, multiple CNNs with different filter sizes and stride lengths work on a raw audio signal on different temporal resolutions, in parallel.…”
Section: Related Workmentioning
confidence: 99%
“…[12] proposed a timedomain network based on the 1D raw waveform. The results in [13,14] demonstrated that the feature directly learned from the raw waveform is sufficient for audio classification problems. However, these works are designed specially for single-label audio classification.…”
Section: Related Workmentioning
confidence: 99%
“…Considering the sound processing field, few studies have been conducted in this field, especially classifying and grading fluids sounds, except the analysis and recognition of human voices, which is a highly active area and works such as speech [34][35][36][37][38][39][40] recognition have been performed in this regard. However, studies have been recently conducted on the classification of fluids sounds in other areas, including human heartbeat [41], urban sounds [42][43][44][45][46][47][48] play sounds, car horns, air conditioning sound, engine sounds, etc. ), and music [49][50][51][52][53][54][55][56], using deep learning.…”
Section: Fig1 Venn Diagram Of Artificial Intelligencementioning
confidence: 99%