2016
DOI: 10.15439/2016f558
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of time-frequency representations for musical onset detection with convolutional neural network.

Abstract: Abstract-In this paper a convolutional neural network is applied to the problem of note onset detection in audio recordings. Two time-frequency representations are analysed, showing the superiority of standard spectrogram over enhanced autocorrelation (EAC) used as the input to the convolutional network. Experimental evaluation is based on a dataset containing 10,939 annotated onsets, with total duration of the audio recordings of over 45 min.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…CNNs are used for note onset detection in audio recordings in the early work [83] for sound event recognition. The use of a spectrogram as an input to the network instead of the enhanced auto-correlation yields better detection performance.…”
Section: Convolutional Neural Network (Cnn)mentioning
confidence: 99%
“…CNNs are used for note onset detection in audio recordings in the early work [83] for sound event recognition. The use of a spectrogram as an input to the network instead of the enhanced auto-correlation yields better detection performance.…”
Section: Convolutional Neural Network (Cnn)mentioning
confidence: 99%
“…The CNN is one of the most popular architectures in many music-related machine learning tasks [16,17,20,25,[44][45][46][47][48][49][50][51][52][53][54][55]. Many of these works adopt an architecture having cascading blocks of 2-dimensional filters and max-pooling, derived from well-known works in image recognition [21,56].…”
Section: Base Architecturementioning
confidence: 99%