2018
DOI: 10.1007/s00521-018-3704-x
|View full text |Cite
|
Sign up to set email alerts
|

Basic filters for convolutional neural networks applied to music: Training or design?

Abstract: When convolutional neural networks are used to tackle learning problems based on music or other time series, raw one-dimensional data are commonly pre-processed to obtain spectrogram or mel-spectrogram coefficients, which are then used as input to the actual neural network. In this contribution, we investigate, both theoretically and experimentally, the influence of this pre-processing step on the network's performance and pose the question whether replacing it by applying adaptive or learned filters directly … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 28 publications
0
12
0
Order By: Relevance
“…Remark For Gabor multipliers c (ψ ⊗ ψ), Propositions 5.4 and 5.5 were proved in [14, Lem. 14], and have been used in the theory of convolutional neural networks [13].…”
Section: The Case C ∈ ∞ (3)mentioning
confidence: 99%
“…Remark For Gabor multipliers c (ψ ⊗ ψ), Propositions 5.4 and 5.5 were proved in [14, Lem. 14], and have been used in the theory of convolutional neural networks [13].…”
Section: The Case C ∈ ∞ (3)mentioning
confidence: 99%
“…Related principles of dimension reduction for other clinical classification problems in OCT have already been successfully applied in [9]. In the second experiment we aim to categorize musical instruments based on their spectrogram, see [18] for related results. Our utilized augmented target loss functions can increase the accuracy in both experiments.…”
Section: Introductionmentioning
confidence: 99%
“…The representation generalizes the first layer of the scattering transform [3]. Independently, a related transform was developed by Dörfler et al [4], which is equivalent to the mel spectrogram and therefore not frequencyuniform and pitch-invariant at the same time.…”
Section: Analysis Of Existing Representationsmentioning
confidence: 99%