2006
DOI: 10.1007/0-387-32845-9
|View full text |Cite
|
Sign up to set email alerts
|

Signal Processing Methods for Music Transcription

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 257 publications
(13 citation statements)
references
References 336 publications
(807 reference statements)
0
13
0
Order By: Relevance
“…Spectral Bandwidth , as defined in Klapuri and Davy (2007) , which is a weighted standard deviation of the spectrum in a given audio segment:…”
Section: Methodsmentioning
confidence: 99%
“…Spectral Bandwidth , as defined in Klapuri and Davy (2007) , which is a weighted standard deviation of the spectrum in a given audio segment:…”
Section: Methodsmentioning
confidence: 99%
“…As a result, 98% of the pairs of stimuli presented in the four-sounds BWS trials had a difference equal to or greater than the spectral centroid JND. The spectral centroid of each sound was computed and averaged for each sound with the Librosa library (Klapuri and Davy, 2007). The loudness of each sound sample was equalized following the European Broadcast Union (EBU) norm on loudness (R-128) with the ffmpeg library (Python Package Index).…”
Section: Stimulimentioning
confidence: 99%
“…Audio feature extraction was performed using the Python module "librosa" (McFee et al, 2015). The features were: Mel-frequency cepstral coeffi-cients (MFCCs), which are widely used features for characterising and detecting voice signals (Klapuri and Davy, 2006); several spectral features like spectral centroids (Klapuri and Davy, 2006), spectral bandwidth (Klapuri and Davy, 2006), spectral roll-off (McFee et al, 2015) and spectral contrast (Jiang et al, 2002); and a 12-bit chroma vector (McFee et al, 2015). For each sentence, we used the timestamp to clip the audio file with a buffer of ±2 s to ensure the full audio of the utterance was captured.…”
Section: Argumentative Relation Classificationmentioning
confidence: 99%